WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 2117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2117 
GCGGGCGACG CCATTGAGCC ACCTTCGAGA GCCCGC 



2) INFORMATION FOR SEQ ID NO: 2118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2118 
GCGGGCTCCT TGACAATTTC TTCGTATCTG TTCTTGGCCC GC 



2) INFORMATION FOR SEQ ID NO: 2119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2119 
GCGGGCTTAC AACCCTAAGG CTGTTCCATT CGTTGCCCGC 



2) INFORMATION FOR SEQ ID NO: 2120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2120 
GCGGGCAGAA ACCAAGGCTG GTAAGGTTAC CGGTAGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2121 
CGACCGCIAG CCGCACACCA AGTTCCGGTC G 
2) INFORMATION FOR SEQ ID NO: 2122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2122 
CCGAGCGAAT GTAGGAGTCC AGGGTCTCTG CTCGG 



2) INFORMATION FOR SEQ ID NO: 2123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2123 
TCTACCACTG AAGCATTAC 
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2) INFORMATION FOR SEQ ID NO: 2124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2124 
TAGGTACTGT AGGTTTATTG 



2) INFORMATION FOR SEQ ID NO: 2125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2125 
CACGCGGATT TTGAATCTCT TCCTCTAGTA GCGCGTG 



2) INFORMATION FOR SEQ ID NO: 2126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2126 
ATATCAGAGA CTGATGAG 



2) INFORMATION FOR SEQ ID NO: 2127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: ' Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2127 
TAGCATATTC AGAGAATATT GT 



2) INFORMATION FOR SEQ ID NO: 2128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2128 
CAACGCTGGA GAATCTATAT TTGTAGAAAC TGCGTTG 
2) INFORMATION FOR SEQ ID NO: 2129 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X51797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 9 

ATGTCTTTAA TATCTAAAGA AGAGTTAATA AAACTCGCAT ATAGCATTAG 50 

ACCAAGAGAA AATGAGTATA AAACTATACT AACTAATTTA GACGAATATA 100 

ATAAGTTAAC TACAAACAAT AATGAAAATA AATATTTGCA ATTAAAAAAA 150 

CTAAATGAAT CAATTGATGT TTTTATGAAT AAATATAAAA CTTCAAGCAG 200 

AAATAGAGCA CTCTCTAATC TAAAAAAAGA TATATTAAAA GAAGTAATTC 250 

TTATTAAAAA TTCCAATACA AGCCCTGTAG AAAAAAATTT ACATTTTGTA 300 

TGGATAGGTG GAGAAGTCAG TGATATTGCT CTTGAATACA TAAAACAATG 3 50 

GGCTGATATT AATGCAGAAT ATAATATTAA ACTGTGGTAT GATAGTGAAG 400 

CATTCTTAGT AAATACACTA AAAAAGGCTA TAGTTGAATC TTCTACCACT 450 

GAAGCATTAC AGCTACTAGA GGAAGAGATT CAAAATCCTC AATTTGATAA 500 

TATGAAATTT TACAAAAAAA GGATGGAATT TATATATGAT AGACAAAAAA 550 

GGTTTATAAA TTATTATAAA TCTCAAATCA ATAAACCTAC AGTACCTACA 600 

ATAGATGATA TTATAAAGTC TCATCTAGTA TCTGAATATA ATAGAGATGA 650 

AACTGTATTA GAATCATATA GAACAAATTC TTTGAGAAAA ATAAATAGTA 700 

ATCATGGGAT AGATATCAGG GCTAATAGTT TGTTTACAGA ACAAGAGTTA 750 

TTAAATATTT ATAGTCAGGA GTTGTTAAAT CGTGGAAATT TAGCTGCAGC 800 
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ATCTGACATA GTAAGATTAT TAGCCCTAAA AAATTTTGGC GGAGTATATT ^850 
TAGATGTTGA TATGCTTCCA GGTATTCACT CTGATTTATT TAAAACAATA 900 
TCTAGACCTA GCTCTATTGG ACTAGACCGT TGGGAAATGA TAAAATTAGA 950 

GGCTATTATG AAGTATAAAA AATATATAAA TAATTATACA TCAGAAAACT 1000 

TTGATAAACT TGATCAACAA TTAAAAGATA ATTTTAAACT CATTATAGAA 1050 

AGTAAAAGTG AAAAATCTGA GATATTTTCT AAATTAGAAA ATTTAAATGT 1100 

ATCTGATCTT GAAATTAAAA TAGCTTTCGC TTTAGGCAGT GTTATAAATC 1150 

AAGCCTTGAT ATCAAAACAA GGTTCATATC TTACTAACCT AGTAATAGAA 1200 

CAAGTAAAAA ATAGATATCA ATTTTTAAAC CAACACCTTA ACCCAGCCAT 1250 

AGAGTCTGAT AATAACTTCA CAGATACTAC TAAAATTTTT CATGATTCAT 1300 

TATTTAATTC AGCTACCGCA GAAAACTCTA TGTTTTTAAC AAAAATAGCA 13 50 

CCATACTTAC AAGTAGGTTT TATGCCAGAA GCTCGCTCCA CAATAAGTTT 14 00 

AAGTGGTCCA GGAGCTTATG CGTCAGCTTA CTATGATTTC ATAAATTTAC 1450 

AAGAAAATAC TATAGAAAAA ACTTTAAAAG CATCAGATTT AATAGAATTT 1500 

AAATTCCCAG AAAATAATCT ATCTCAATTG ACAGAACAAG AAATAAATAG 1550 

TCTATGGAGC TTTGATCAAG CAAGTGCAAA ATATCAATTT GAGAAATATG 1600 

TAAGAGATTA TACTGGTGGA TCTCTTTCTG AAGACAATGG GGTAGACTTT 1650 

AATAAAAATA CTGCCCTCGA CAAAAACTAT TTATTAAATA ATAAAATTCC 1700 

ATCAAACAAT GTAGAAGAAG CTGGAAGTAA AAATTATGTT CATTATATCA 1750 

TACAGTTACA AGGAGATGAT ATAAGTTATG AAGCAACATG CAATTTATTT 1800 

TCTAAAAATC CTAAAAATAG TATTATTATA CAACGAAATA TGAATGAAAG 1850 

TGCAAAAAGC TACTTTTTAA GTGATGATGG AGAATCTATT TTAGAATTAA 1900 

ATAAATATAG GATACCTGAA AGATTAAAAA ATAAGGAAAA AGTAAAAGTA 1950 

ACCTTTATTG GACATGGTAA AGATGAATTC AACACAAGCG AATTTGCTAG 2000 

ATTAAGTGTA GATTCACTTT CCAATGAGAT AAGTTCATTT TTAGATACCA 2050 

TAAAATTAGA TATATCACCT AAAAATGTAG AAGTAAACTT ACTTGGATGT 2100 

AATATGTTTA GTTATGATTT TAATGTTGAA GAAACTTATC CTGGGAAGTT 2150 

GCTATTAAGT ATTATGGACA AAATTACTTC CACTTTACCT GATGTAAATA 2200 

AAAATTCTAT TACTATAGGA GCAAATCAAT ATGAAGTAAG AATTAATAGT 2250 

GAGGGAAGAA AAGAACTTCT GGCTCACTCA GGTAAATGGA TAAATAAAGA 2300 

AGAAGCTATT ATGAGCGATT TATCTAGTAA AGAATACATT TTTTTTGATT 2350 

CTATAGATAA TAAGCTAAAA GCAAAGTCCA AGAATATTCC AGGATTAGCA 2400 

TCAATATCAG AAGATATAAA AACATTATTA CTTGATGCAA GTGTTAGTCC 2450 

TGATACAAAA TTTATTTTAA ATAATCTTAA GCTTAATATT GAATCTTCTA 2 500 

TTGGGGATTA CATTTATTAT GAAAAATTAG AGCCTGTTAA AAATATAATT 2550 

CACAATTCTA TAGATGATTT AATAGATGAG TTCAATCTAC TTGAAAATGT 2600 

ATCTGATGAA TTATATGAAT TAAAAAAATT AAATAATCTA GATGAGAAGT 2650 

ATTTAATATC TTTTGAAGAT ATCTCAAAAA ATAATTCAAC TTACTCTGTA 2700 

AGATTTATTA ACAAAAGTAA TGGTGAGTCA GTTTATGTAG AAACAGAAAA 2750 

AGAAATTTTT TCAAAATATA GCGAACATAT TACAAAAGAA ATAAGTACTA 2800 

TAAAGAATAG TATAATTACA GATGTTAATG GTAATTTATT GGATAATATA 2850 

CAGTTAGATC ATACTTCTCA AGTTAATACA TTAAACGCAG CATTCTTTAT 2900 
TCAATCATTA ATAGATTATA GTAGCAATAA AGATGTACTG AATGATTTAA 2 950 
GTACCTCAGT TAAGGTTCAA CTTTATGCTC AACTATTTAG TACAGGTTTA 3 000 
AATACTATAT ATGACTCTAT CCAATTAGTA AATTTAATAT CAAATGCAGT 3050 
AAATGATACT ATAAATGTAC TACCTACAAT AACAGAGGGG ATACCTATTG 3100 
TATCTACTAT ATTAGACGGA ATAAACTTAG GTGCAGCAAT TAAGGAATTA 3150 
CTAGACGAAC ATGACCCATT ACTAAAAAAA GAATTAGAAG CTAAGGTGGG 3200 
TGTTTTAGCA ATAAATATGT CATTATCTAT AGCTGCAACT GTAGCTTCAA 3250 
TTGTTGGAAT AGGTGCTGAA GTTACTATTT TCTTATTACC TATAGCTGGT 33 00 

ATATCTGCAG GAATACCTTC ATTAGTTAAT AATGAATTAA TATTGCATGA 3350 
TAAGGCAACT TCAGTGGTAA ACTATTTTAA TCATTTGTCT GAATCTAAAA 3400 
AATATGGCCC TCTTAAAACA GAAGATGATA AAATTTTAGT TCCTATTGAT 3450 
GATTTAGTAA TATCAGAAAT AGATTTTAAT AATAATTCGA TAAAACTAGG 3500 
AACATGTAAT ATATTAGCAA TGGAGGGGGG ATCAGGACAC ACAGTGACTG 3550 
GTAATATAGA TCACTTTTTC TCATCTCCAT CTATAAGTTC TCATATTCCT 3600 
TCATTATCAA TTTATTCTGC AATAGGTATA GAAACAGAAA ATCTAGATTT 3 650 
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TTCAAAAAAA ATAATGATGT TACCTAATGC TCCTTCAAGA GTGTTTTGGT "3700 

GGGAAACTGG AGCAGTTCCA GGTTTAAGAT CATTGGAAAA TGACGGAACT 3750 

AGATTACTTG ATTCAATAAG AGATTTATAC CCAGGTAAAT TTTACTGGAG 3 800 

ATTCTATGCT TTTTTCGATT ATGCAATAAC TACATTAAAA CCAGTTTATG 3850 

AAGACACTAA TATTAAAATT AAACTAGATA AAGATACTAG AAACTTCATA 3 900 

ATGCCAACTA TAACTACTAA CGAAATTAGA AACAAATTAT CTTATTCATT 3950 

TGATGGAGCA GGAGGAACTT ACTCTTTATT ATTATCTTCA TATCCAATAT 4000 

CAACGAATAT AAATTTATCT AAAGATGATT TATGGATATT TAATATTGAT 4050 

AATGAAGTAA GAGAAATATC TATAGAAAAT GGTACTATTA AAAAAGGAAA 4100 

GTTAATAAAA GATGTTTTAA GTAAAATTGA TATAAATAAA AATAAACTTA 4150 

TTATAGGCAA TCAAACAATA GATTTTTCAG GCGATATAGA TAATAAAGAT 4200 

AGATATATAT TCTTGACTTG TGAGTTAGAT GATAAAATTA GTTTAATAAT 4250 

AGAAATAAAT CTTGTTGCAA AATCTTATAG TTTGTTATTG TCTGGGGATA 4300 

AAAATTATTT GATATCCAAT TTATCTAATA CTATTGAGAA AATCAATACT 4350 

TTAGGCCTAG ATAGTAAAAA TATAGCGTAC AATTACACTG ATGAATCTAA 44 00 

TAATAAATAT TTTGGAGCTA TATCTAAAAC AAGTCAAAAA AGCATAATAC 44 50 

ATTATAAAAA AGACAGTAAA AATATATTAG AATTTTATAA TGACAGTACA 4500 

TTAGAATTTA ACAGTAAAGA TTTTATTGCT GAAGATATAA ATGTATTTAT 4 550 

GAAAGATGAT ATTAATACTA TAACAGGAAA ATACTATGTT GATAATAATA 5000 

CTGATAAAAG TATAGATTTC TCTATTTCTT TAGTTAGTAA AAATCAAGTA 5050 

AAAGTAAATG GATTATATTT AAATGAATCC GTATACTCAT CTTACCTTGA 5100 

TTTTGTGAAA AATTCAGATG GACACCATAA TACTTCTAAT TTTATGAATT 5150 

TATTTTTGGA CAATATAAGT TTCTGGAAAT TGTTTGGGTT TGAAAATATA 5200 

AATTTTGTAA TCGATAAATA CTTTACCCTT GTTGGTAAAA CTAATCTTGG 5250 

ATATGTAGAA TTTATTTGTG ACAATAATAA AAATATAGAT ATATATTTTG 53 00 

GTGAATGGAA AACATCGTCA TCTAAAAGCA CTATATTTAG CGGAAATGGT 5350 

AGAAATGTTG TAGTAGAGCC TATATATAAT CCTGATACGG GTGAAGATAT 5400 

ATCTACTTCA CTAGATTTTT CCTATGAACC TCTCTATGGA ATAGATAGAT 5450 

ATATAAATAA AGTATTGATA GCAC CTGATT TATATACAAG TTTAATAAAT 5500 

ATTAATACCA ATTATTATTC AAATGAGTAC TACCCTGAGA TTATAGTTCT 5550 

TAACCCAAAT ACATTCCACA AAAAAGTAAA TATAAATTTA GATAGTTCTT 5600 

CTTTTGAGTA TAAATGGTCT ACAGAAGGAA GTGACTTTAT TTTAGTTAGA 5650 

TACTTAGAAG AAAGTAATAA AAAAATATTA CAAAAAATAA GAATCAAAGG 5700 

TATCTTATCT AATACTCAAT CATTTAATAA AATGAGTATA GATTTTAAAG 5750 

ATATTAAAAA ACTATCATTA GGATATATAA TGAGTAATTT TAAATCATTT 5800 

AATTCTGAAA ATGAATTAGA TAGAGATCAT TTAGGATTTA AAATAATAGA 5850 

TAATAAAACT TATTACTATG ATGAAGATAG TAAATTAGTT AAAGGATTAA 5900 

GCAAGTAATT GACAATAATA AGTATTATTT CAATCCTGAC ACTGCTATCA 5950 

TCTCAAAAGG TTGGCAGACT GTTAATGGTA GTAGATACTA CTTTGATACT 6000 

GATACCGCTA TTGCCTTTAA TGGTTATAAA ACTATTGATG GTAAACACTT 6050 

TTATTTTGAT AGTGATTGTG TAGTGAAAAT AGGTGTGTTT AGTACCTCTA 6100 

ATGGATTTGA ATATTTTGCA CCTGCTAATA CTTATAATAA TAACATAGAA 6150 

GGTCAGGCTA TAGTTTATCA AAGTAAATTC TTAACTTTGA ATGGTAAAAA 62 00 

ATATTACTTT GATAATAACT CAAAAGCAGT TACCGGATTG CAAACTATTG 62 50 

ATAGTAAAAA ATATTACTTT AATACTAACA CTGCTGAAGC AGCTACTGGA 6300 

TGGCAAACTA TTGATGGTAA AAAATATTAC TTTAATACTA ACACTGCTGA 6350 

AGCAGCTACT GGATGGCAAA CTATTGATGG TAAAAAATAT TACTTTAATA 64 00 

CTAACACTGC TATAGCTTCA ACTGGTTATA CAATTATTAA TGGTAAACAT 6450 

TTTTATTTTA ATACTGATGG TATTATGCAG ATAGGAGTGT TTAAAGGACC 6500 

TAATGGATTT GAATATTTTG CACCTGCTAA TACGGATGCT AACAACATAG 6550 

AAGGTCAAGC TATACTTTAC CAAAATGAAT TCTTAACTTT GAATGGTAAA 6 600 

AAATATTACT TTGGTAGTGA CTCAAAAGCA GTTACTGGAT GGAGAATTAT 6650 

TAACAATAAG AAATATTACT TTAATCCTAA TAATGCTATT GCTGCAATTC 6700 

ATCTATGCAC TATAAATAAT GACAAGTATT ACTTTAGTTA TGATGGAATT 6750 

CTTCAAAATG GATATATTAC TATTGAAAGA AATAATTTCT ATTTTGATGC 6800 

TAATAATGAA TCTAAAATGG TAACAGGAGT ATTTAAAGGA CCTAATGGAT 6850 

TTGAGTATTT TGCACCTGCT AATACTCACA ATAATAACAT AGAAGGTCAG 6 900 
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GCTATAGTTT ACCAGAACAA ATTCTTAACT 
TTTTGATAAT GACTCAAAAG CAGTTACTGG 
AAAAATATTA CTTTAATCTT AACACTGCTG 
ACTATTGATG GTAAAAAATA TTACTTTAAT 
TACTGGATGG CAAACTATTG ATGGTAAAAA 
CTTTGATAGC CTCAACTGGT TATACAAGTA 
TTTAATACTG ATGGTATTAT GCAGATAGGA 
ATTTGAATAC TTTGCACCTG CTAATACGGA 
AAGCTATACT TTACCAAAAT AAATTCTTAA 
TACTTTGGTA GTGACTCAAA AGCAGTTACC 
TAAAAAATAT TACTTTAATA CTAACACTGC 
AAACTATTAA TGGTAAAAAA TACTACTTTA 
TCAACTGGTT ATACAATTAT TAGTGGTAAA 
TGGTATTATG CAGATAGGAG TGTTTAAAGG 
TTGCACCTGC TAATACAGAT GCTAACAATA 
TATCAAAATA GATTCCTATA TTTACATGAC 
TAATTCAAAA GCGGCTACTG GTTGGGTAAC 
ACTTCGAGCC TAATACAGCT ATGGGTGCGA 
AATAAAAATT TTTACTTTAG AAATGGTTTA 
AGGGTCTAAT GGATTTGAAT ACTTTGCACC 
ATATAGAAGG TCAAGCTATA CGTTATCAAA 
GGAAAAATAT ATTACTTTGG TAATAATTCA 
AACTATTAAT GGTAAAGTAT ATTACTTTAT 
CAGCTGGTGG ACTTTTCGAG ATTGATGGTG 
GATGGAGTAA AAGCCCCTGG GATATATGGC 



PCT/CA00/01150 

TTGAATGGCA AAAAATATTA 6^9 50 

ATGGCAAACC ATTGATGGTA 7000 

AAGCAGCTAC TGGATGGCAA 7050 

CTTAACACTG CTGAAGCAGC 7100 

ATATTACTTT AATACTAACA 7150 

TTAATGGTAA ACATTTTTAT 7200 

GTGTTTAAAG GACCTAATGG 7250 

TGCTAACAAC ATAGAAGGTC 73 00 

CTTTGAATGG TAAAAAATAT 7350 

GGACTGCGAA CTATTGATGG 74 0 0 

TGTTGCAGTT ACTGGATGGC 74 50 

ATACTAACAC TTCTATAGCT 750 0 

CATTTTTATT TTAATACTGA 755 0 

ACCTGATGGA TTTGAATACT 7600 

TAGAAGGTCA AGCTATACGT 7650 

AATATATATT ATTTTGGTAA 7700 

TATTGATGGT AATAGATATT 7750 

ATGGTTATAA AACTATTGAT 7800 

CCTCAGATAG GAGTGTTTAA 7850 

TGCTAATACG GATGCTAACA 7 900 

ATAGATTCCT ACATTTACTT 7950 

AAAGCAGTTA CTGGATGGCA 80 0 0 

GCCTGATACT GCTATGGCTG 8 050 

TTATATATTT CTTTGGTGTT 810 0 

TAA 8133 



2) INFORMATION FOR SEQ ID NO: 2130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X53138 

(xi) SEQUENCE DESCRIPTION: SE 



ID NO: 2130 






AAAATGGCAA 


ATGTAAGATT 


50 


ATTGGATGCT 


TTAGAAGAAT 


100 


AAAAATATTT 


AAAATTAAAA 


150 


GATACATATA 


AAAAATCTGG 


200 


ATATCTAGTT 


ACAGAAGTAT 


250 


TTGAGAAAAA 


TTTACATTTT 


300 


GCTATTAATT 


ATATAAATCA 


350 


TAATGTTTTT 


TATGATAGTA 


400 


CTGTAGTAGA 


ATCAGCAATA 


450 


TTAAATGACC 


CTAGATTTGA 


500 


AATAATTTAT 


GATAAACAGA 


550 


GAGAAGAAAA 


TCCTGAACTT 


600 



1097 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

ATAATTGATG ATATTGTAAA GACATATCTT TCAAATGAGT ATTCAAAGGA 6 50 

GATAGATGAA CTTAATACCT ATATTGAAGA ATCCTTAAAT AAAATTACAC 700 

AGAATAGTGG AAATGATGTT AGAAACTTTG AAGAATTTAA AAATGGAGAG 750 

TCATTCAACT TATATGAACA AGAGTTGGTA GAAAGGTGGA ATTTAGCTGC 800 

TGCTTCTGAC ATATTAAGAA TATCTGCATT AAAAGAAATT GGTGGTATGT 850 

ATTTAGATGT TGATATGTTA CCAGGAATAC AACCAGACTT ATTTGAGTCT 900 

ATAGAGAAAC CTAGTTCAGT AACAGTGGAT TTTTGGGAAA TGACAAAGTT 950 

AGAAGCTATA ATGAAATACA AAGAATATAT ACCAGAATAT ACCTCAGAAC 1000 

ATTTTGACAT GTTAGACGAA GAAGTTCAAA GTAGTTTTGA ATCTGTTCTA 1050 

GCTTCTAAGT CAGATAAATC AGAAATATTC TCATCACTTG GTGATATGGA 1100 

GGCATCACCA CTAGAAGTTA AAATTGCATT TAATAGTAAG GGTATTATAA 1150 

ATCAAGGGCT AATTTCTGTG AAAGACTCAT ATTGTAGCAA TTTAATAGTA 1200 

AAACAAATCG AGAATAGATA TAAAATATTG AATAATAGTT TAAATCCAGC 1250 

TATTAGCGAG GATAATGATT TTAATACTAC AACGAATACC TTTATTGATA 13 00 

GTATAATGGC TGAAGCTAAT GCAGATAATG GTAGATTTAT GATGGAACTA 1350 

GGAAAGTATT TAAGAGTTGG TTTCTTCCCA GATGTTAAAA CTACTATTAA 14 00 

CTTAAGTGGC CCTGAAGCAT ATGCGGCAGC TTATCAAGAT TTATTAATGT 1450 

TTAAAGAAGG CAGTATGAAT ATCCATTTGA TAGAAGCTGA TTTAAGAAAC 1500 

TTTGAAATCT CTAAAACTAA TATTTCTCAA TCAACTGAAC AAGAAATGGC 1550 

TAGCTTATGG TCATTTGACG ATGCAAGAGC TAAAGCTCAA TTTGAAGAAT 1600 

ATAAAAGGAA TTATTTTGAA GGTTCTCTTG GTGAAGATGA TAATCTTGAT 1650 

TTTTCTCAAA ATATAGTAGT TGACAAGGAG TATCTTTTAG AAAAAATATC 1700 

TTCATTAGCA AGAAGTTCAG AGAGAGGATA TATACACTAT ATTGTTCAGT 1750 

TACAAGGAGA TAAAATTAGT TATGAAGCAG CATGTAACTT ATTTGCAAAG 1800 

ACTCCTTATG ATAGTGTACT GTTTCAGAAA AATATAGAAG ATTCAGAAAT 1850 

TGCATATTAT TATAATCCTG GAGATGGTGA AATACAAGAA ATAGACAAGT 1900 

ATAAAATTCC AAGTATAATT TCTGATAGAC CTAAGATTAA ATTAACATTT 1950 

ATTGGTCATG GTAAAGATGA ATTTAATACT GATATATTTG CAGGTTTTGA 2 000 

TGTAGATTCA TTATCCACAG AAATAGAAGC AGCAATAGAT TTAGCTAAAG 2050 

AGGATATTTC TCCTAAGTCA ATAGAAATAA ATTTATTAGG ATGTAATATG 2100 

TTTAGCTACT CTATCAACGT AGAGGAGACT TATCCTGGAA AATTATTACT 2150 

TAAAGTTAAA GATAAAATAT CAGAATTAAT GCCATCTATA AGTCAAGACT 22 00 

CTATTATAGT AAGTGCAAAT CAATATGAAG TTAGAATAAA TAGTGAAGGA 2250 

AGAAGAGAAT TATTGGATCA TTCTGGTGAA TGGATAAATA AAGAAGAAAG 2300 

TATTATAAAG GATATTTCAT CAAAAGAATA TATATCATTT AATCCTAAAG 2350 

AAAATAAAAT TACAGTAAAA TCTAAAAATT TACCTGAGCT ATCTACATTA 2400 

TTACAAGAAA TTAGAAATAA TTCTAATTCA AGTGATATTG AACTAGAAGA 2450 

AAAAGTAATG TTAACAGAAT GTGAGATAAA TGTTATTTCA AATATAGATA 2500 

CGCAAATTGT TGAGGAAAGG ATTGAAGAAG CTAAGAATTT AACTTCTGAC 2550 

TCTATTAATT ATATAAAAGA TGAATTTAAA CTAATAGAAT CTATTTCTGA 26 00 

TGCACTATGT GACTTAAAAC AACAGAATGA ATTAGAAGAT TCTCATTTTA 2650 

TATCTTTTGA GGACATATCA GAGACTGATG AGGGATTTAG TATAAGATTT 2700 

ATTAATAAAG AAACTGGAGA ATCTATATTT GTAGAAACTG AAAAAACAAT 2750 

ATTCTCTGAA TATGCTAATC ATATAACTGA AGAGATTTCT AAGATAAAAG 2800 
GTACTATATT TGATACTGTA AATGGTAAGT TAGTAAAAAA AGTAAATTTA 2850 
GATACTACAC ACGAAGTAAA TACTTTAAAT GCTGCATTTT TTATACAATC 2900 
ATTAATAGAA TATAATAGTT CTAAAGAATC TCTTAGTAAT TTAAGTGTAG 2950 
CAATGAAAGT CCAAGTTTAC GCTCAATTAT TTAGTACTGG TTTAAATACT 3 00 0 

ATTACAGATG CAGCCAAAGT TGTTGAATTA GTATCAACTG CATTAGATGA 3 050 

AACTATAGAC TTACTTCCTA CATTATCTGA AGGATTACCT ATAATTGCAA 3100 
CTATTATAGA TGGTGTAAGT TTAGGTGCAG CAATCAAAGA GCTAAGTGAA 3150 
ACGAGTGACC CATTATTAAG ACAAGAAATA GAAGCTAAGA TAGGTATAAT 3200 
GGCAGTAAAT TTAACAACAG CTACAACTGC AATCATTACT TCATCTTTGG 3250 
GGATAGCTAG TGGATTTAGT ATACTTTTAG TTCCTTTAGC AGGAATTTCA 3300 
GCAGGTATAC CAAGCTTAGT AAACAATGAA CTTGTACTTC GAGATAAGGC 33 50 

AACAAAGGTT GTAGATTATT TTAAACATGT TTCATTAGTT GAAACTGAAG 3400 
GAGTATTTAC TTTATTAGAT GATAAAATAA TGATGCCAGA AGATGATTTA 3450 
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GTGATATCAG AAATAGATTT TAATAATAAT TCAATAGTTT TAGGTAAATG 3500 

TGAAATCTGG AGAATGGAAG GTGGTTCAGG TCATACTGTA ACTGATGATA 3 5 50 

TAGATCACTT CTTTTCAGCA CCATCAATAA CATATAGAGA GCCACACTTA 36 00 

TCTATATATG ACGTATTGGA AGTACAAAAA GAAGAACTTG ATTTGTCAAA 3650 

AGATTTAATG GTATTACCTA ATGCTCCAAA TAGAGTATTT GCTTGGGAAA 3700 

CAGGATGGAC ACCAGGTTTA AGAAGCTTAG AAAATGATGG CACAAAACTG 3750 

TTAGACCGTA TAAGAGATAA CTATGAAGGT GAGTTTTATT GGAGATATTT 3800 

TGCTTTTATA GCTGATGCTT TAATAACAAC ATTAAAACCA AGATATGAAG 3850 

ATACTAATAT AAGAATAAAT TTAGATAGTA ATACTAGAAG TTTTATAGTT 3 900 

CCAATAATAA CTACAGAATA TATAAGAGAA AAATTATCAT ATTCTTTCTA 3950 

TGGTTCAGGA GGAACTTATG CATTGTCTCT TTCTCAATAT AATATGGGTA 4000 

TAAATATAGA ATTAAGTGAA AGTGATGTTT GGATTATAGA TGTTGATAAT 4050 

GTTGTGAGAG ATGTAACTAT AGAATCTGAT AAAATTAAAA AAGGTGATTT 4100 

AATAGAAGGT ATTTTATCTA CACTAAGTAT TGAAGAGAAT AAAATTATCT 4150 

TAAATAGCCA TGAGATTAAT TTTTCTGGTG AGGTAAATGG AAGTAATGGA 4200 

TTTGTTTCTT TAACATTTTC AATTTTAGAA GGAATAAATG CAATTATAGA 42 50 

AGTTGATTTA TTATCTAAAT CATATAAATT ACTTATTTCT GGCGAATTAA 4300 

AAATATTGAT GTTAAATTCA AATCATATTC AACAGAAAAT AGATTATATA 4350 

GGATTCAATA GCGAATTACA GAAAAATATA CCATATAGCT TTGTAGATAG 4400 

TGAAGGAAAA GAGAATGGTT TTATTAATGG TTCAACAAAA GAAGGTTTAT 4450 

TTGTATCTGA ATTACCTGAT GTAGTTCTTA TAAGTAAGGT TTATATGGAT 4500 

GATAGTAAGC CTTCATTTGG ATATTATAGT AATAATTTGA AAGATGTCAA 4550 

AGTTATAACT AAAGATAATG TTAATATATT AACAGGTTAT TATCTTAAGG 4600 

ATGATATAAA AATCTCTCTT TCTTTGACTC TACAAGATGA AAAAACTATA 4650 

AAGTTAAATA GTGTGCATTT AGATGAAAGT GGAGTAGCTG AGATTTTGAA 4700 

GTTCATGAAT AGAAAAGGTA ATACAAATAC TTCAGATTCT TTAATGAGCT 4750 

TTTTAGAAAG TATGAATATA AAAAGTATTT TCGTTAATTT CTTACAATCT 4800 

AATATTAAGT TTATATTAGA TGCTAATTTT ATAATAAGTG GTACTACTTC 4 850 

TATTGGCCAA TTTGAGTTTA TTTGTGATGA AAATGATAAT ATACAACCAT 4900 

ATTTCATTAA GTTTAATACA CTAGAAACTA ATTATACTTT ATATGTAGGA 4950 

AATAGACAAA ATATGATAGT GGAACCAAAT TATGATTTAG ATGATTCTGG 5000 

AGATATATCT TCAACTGTTA TCAATTTCTC TCAAAAGTAT CTTTATGGAA 5050 

TAGACAGTTG TGTTAATAAA GTTGTAATTT CACCAAATAT TTATACAGAT 5100 

GAAATAAATA TAACGCCTGT ATATGAAACA AATAATACTT ATCCAGAAGT 5150 

TATTGTATTA GATGCAAATT ATATAAATGA AAAAATAAAT GTTAATATCA 5200 

ATGATCTATC TATACGATAT GTATGGAGTA ATGATGGTAA TGATTTTATT 5250 

CTTATGTCAA CTAGTGAAGA AAATAAGGTG TCACAAGTTA AAATAAGATT 53 00 

CGTTAATGTT TTTAAAGATA AGACTTTGGC AAATAAGCTA TCTTTTAACT 5350 

TTAGTGATAA ACAAGATGTA CCTGTAAGTG AAATAATCTT ATCATTTACA 5400 

CCTTCATATT ATGAGGATGG ATTGATTGGC TATGATTTGG GTCTAGTTTC 5450 

TTTATATAAT GAGAAATTTT ATATTAATAA CTTTGGAATG ATGGTATCTG 5500 

GATTAATATA TATTAATGAT TCATTATATT ATTTTAAACC ACCAGTAAAT 5550 

AATTTGATAA CTGGATTTGT GACTGTAGGC GATGATAAAT ACTACTTTAA 5600 

TCCAATTAAT GGTGGAGCTG CTTCAATTGG AGAGACAATA ATTGATGACA 5650 

AAAATTATTA TTTCAACCAA AGTGGAGTGT TACAAACAGG TGTATTTAGT 5700 

ACAGAAGATG GATTTAAATA TTTTGCCCCA GCTAATACAC TTGATGAAAA 5750 

CCTAGAAGGA GAAGCAATTG ATTTTACTGG AAAATTAATT ATTGACGAAA 5800 

ATATTTATTA TTTTGATGAT AATTATAGAG GAGCTGTAGA ATGGAAAGAA 5850 

TTAGATGGTG AAATGCACTA TTTTAGCCCA GAAACAGGTA AAGCTTTTAA 5900 

AGGTCTAAAT CAAATAGGTG ATTATAAATA CTATTTCAAT TCTGATGGAG 5950 

TTATGCAAAA AGGATTTGTT AGTATAAATG ATAATAAACA CTATTTTGAT 6000 

GATTCTGGTG TTATGAAAGT AGGTTACACT GAAATAGATG GCAAGCATTT 6050 

CTACTTTGCT GAAAACGGAG AAATGCAAAT AGGAGTATTT AATACAGAAG 6100 

ATGGATTTAA ATATTTTGCT CATCATAATG AAGATTTAGG AAATGAAGAA 6150 

GGTGAAGAAA TCTCATATTC TGGTATATTA AATTTCAATA ATAAAATTTA 6200 

CTATTTTGAT GATTCATTTA CAGCTGTAGT TGGATGGAAA GATTTAGAGG 6250 

ATGGTTCAAA GTATTATTTT GATGAAGATA CAGCAGAAGC ATATATAGGT 6300 
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TTGTCATTAA TAAATGATGG 
GCAAGTTGGA TTTGTCACTA 
CTGGAATTAT AGAATCTGGA 
ATAGATGATA ATGGTATAGT 
ATATAAATAT TTTGCACCTG 
AAGCAGTTGA ATATAGTGGT 
TTTGGAGAAA CATATACAAT 
TGAAAGTGAT AAATATTATT 
GTATTAATTT AATTGATGAT 
ATGAGAACGG GTCTTATATC 
GAATGGTGAA ATGCAATTTG 
ATTTTGGTGA AGATGGTGTC 
GGATTTAAAT ACTTTGCACA 
AGAATCAATA AACTATACTG 
ATTTTACAGA TGAATATATT 
GAGGAGTATT ATTTTGATCC 
G 



TCAATATTAT TTTAATGATG 
TAAATGATAA AGTCTTCTAC 
GTACAAAACA TAGATGACAA 
TCAAATTGGT GTATTTGATA 
CTAATACTGT AAATGATAAT 
TTAGTTAGAG TTGGGGAAGA 
TGAGACTGGA TGGATATATG 
TCAATCCAGA AACTAAAAAA 
ATAAAATATT ATTTTGATGA 
ATTTGAAAAT AATAATTATT 
GTTATATAAA TATAGAAGAT 
ATGCAGATTG GAGTATTTAA 
TCAAAATACT TTGGATGAGA 
GTTGGTTAGA TTTAGATGAA 
GCAGCAACTG GTTCAGTTAT 
TGATACAGCT CAATTAGTGA 



ATGGAATTAT 


6350 


TTCTCTGACT 


6400 


TTATTTCTAT 


6450 


CTTCAGATGG 


6500 


ATTTACGGAC 


6550 


TGTATATTAT 


6600 


ATATGGAAAA 


6650 


GCATGCAAAG 


6700 


GAAGGGCATA 


6750 


ACTTTAATGA 


6800 


AAGATGTTCT 


6850 




6900 


ATTTTGAGGG 


6950 


AAGAGATATT 


7000 


TATTGATGGT 


7050 


TTAGTGAATA 


7100 




7101 



2) INFORMATION FOR SEQ ID NO: 2131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2131 
AAGCMATTGT TGTAAATTTT GAAAG 



2) INFORMATION FOR SEQ ID NO: 2132 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2132 
TCATATCCAT AGCAATAGTT CTA 



2) INFORMATION FOR SEQ ID NO: 2133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2133 
CCACGCACAW AAACTTGTTT TAGAAGTAGC AGCWCAGCGT GG 



2) INFORMATION FOR SEQ ID NO: 2134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2134 
CGCGTGAAGC TTCTGTG 



2) INFORMATION FOR SEQ ID NO: 2135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2135 
TCTCGCCGTT ATTCAGTTTC 



2) INFORMATION FOR SEQ ID NO: 2136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2136 
CCAACGCGTC CTCAATCATT TCTAACTTCT ATGGCCGGCG TTGG 
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2) INFORMATION FOR SEQ ID NO: 2137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2137 

ATGATCACCG GTGCTGCCCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

CGCCGATGGT CCGATGCCrC AAACCCGTGA GCACATCCTG CTGTCCCGTC 100 

AGGTAGGCGT TCCGTACATC GTGGTCTTCC TGAACAAGGC CGACCTGGTA 150 

GACGACGCTG AGCTGCTGGA ACTGGTCGAG ATGGAAGTTC GCGACCTGCT 2 00 

GTCCACCTAC GACTTCCCAG GCGACGACAC TCCGATCATC ATCGGTTCCG 250 

CTCGTATGGC CCTGGAAGGC AAAGACGACA ACGAAATGGG CACTACCGCT 3 00 

GTCAAGAAGC TGGTAGAAAC TCTGGATGCC TACATCCCTG AGCCAGTTCG 3 50 

TGCCATCGAC CAGCCGTTCC TGATGCCGAT CGAAGACGTG TTCTCGATCT 400 

CGGGTCGTGG TACCGTTGTT ACCGGTCGTA TCGAGCGTGG CATCGTTCGC 450 

GTTCAGGATC CGCTGGAAAT CGTTGGTCTG CGTGACACCA CCACCACCAC 500 

CTGCACCGGT GTTGAGATGT TCCGCAAGCT GCTGGACGAA GGTCGTGCTG 550 

GCGAGAACTG CGGCGTTCTG CTGCGTGGTA CCAAGCGTGA CGACGTTGAG 600 

CGTGGCCAGG TTCTGGTCAA GCCAGGTTCG GTCAAGCCGC ACACCAAGTT 650 

CACCGCAGAA GTCTACGTCC TGTCGAAGGA AGAAGGCGGC CGTCACACTC 70 0 

CGTTCTTCAA AGGCTACCGT CCACAGTTCT ACTTCCGTAC CACTGACGTG 750 

ACCGGTAACT GCGAACTGCC GGAAGGCGTT GAAATGGTAA TGCCAGGTGA 800 

CAACATTCAG ATGACTGTTA CCCTGATCAA GACCATCGCA ATGGAAGACG 850 

GTCTGCGCTT CGCTATCCGT GAAGGCGGTC GTACCGTCG 889 



2) INFORMATION FOR SEQ ID NO: 2138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(C) ACCESSION NUMBER: Genome project 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2138 
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AACATGATCA CCGGTGCTGC CCAGATGGAC GGCGCGATCC TGGTTTGCTC 50 

GGCTGCCGAC GGCCCCATGC CGCAGACCCG CGAGCACATC CTGCTGTCCC 100 

GCCAGGTAGG CGTTCCCTAC ATCGTCGTGT TCCTGAACAA GGCCGACATG 150 

GTCGACGACG CCGAGCTGCT GGAACTGGTC GAGATGGAAG TTCGCGATCT 200 

GCTGAACACC TACGACTTCC CGGGCGACGA CACTCCGATC ATCATCGGTT 250 

CCGCGCTGAT GGCGCTGGAA GGCAAGGATG ACAACGGCAT CGGCGTAAGC 3 00 

GCCGTGCAGA AGCTGGTAGA GACCCTGGAC TCCTACATTC CGGAGCCGGT 350 

TCGTGCCATC GACCAGCCGT TCCTGATGCC GATCGAAGAC GTGTTCTCGA 4 00 

TCTCCGGCCG CGGTACCGTG GTAACCGGTC GTGTAGAGCG CGGCATCATC 450 

AAGGTCCAGG AAGAAGTGGA AATCGTCGGC ATCAAGGCGA CCACCAAGAC 500 

TACCTGCACC GGCGTTGAAA TGTTCCGCAA GCTGCTCGAC GAAGGTCGTG 550 

CTGGTGAGAA CGTTGGTATC CTGCTGCGTG GCACCAAGCG TGAAGACGTA 600 

GAGCGTGGCC AGGTTCTGGC CAAGCCGGGC ACCATCAAGC CGCACACCAA 650 

GTTCGAGTGC GAAGTGTACG TGCTGTCCAA GGAAGAAGGT GGTCGTCACA 700 

CCCCGTTCTT CAAGGGCTAC CGTCCGCAGT TCTACTTCCG TACCACCGAC 750 

GTGACCGGTA ACTGCGAACT GCCGGAAGGC GTAGAGATGG TAATGCCGGG 800 

CGACAACATC AAGATGGTTG TCACCCTGAT CGCTCCGATC GCCATGGAAG 850 

ATGGCCTGCG CTTCGCGATC CGCGAAGGCG GCCGTACCGT TGGCGCCGGC 900 



2) INFORMATION FOR SEQ ID NO: 2139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni 

(B) STRAIN: NCTC 11168 

(C) ACCESSION NUMBER: AL139074 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2139 

ATGCAAGGAT TTATTTCACA GGTATTAGGT CCGGTTGTTG ATGTAGATTT 50 

TAACGACTAT TTGCCTCAAA TTAATGAAGC AATTGTTGTA AATTTTGAAA 100 

GCGAAGGAAA AAAACATAAA CTTGTTTTAG AAGTAGCAGC TCATTTAGGA 150 

GATAATAGAG TTAGAACTAT TGCTATGGAT ATGACAGATG GTTTGGTAAG 200 

GGGCTTAAAA GCTGAGGCTT TAGGTGCTCC CATTAGTGTT CCTGTTGGTG 250 

AGAAAGTTTT AGGAAGAATT TTCAATGTTA CTGGAGATTT GATCGATGAA 300 

GGTGAAGAAA TTTCTTTCGA TAAAAAATGG GCAATTCATA GAGATCCACC 3 50 

AGCTTTTGAA GATCAAAGCA CAAAAAGTGA GATTTTTGAA ACAGGGATTA 400 

AAGTTGTAGA TTTACTTGCT CCTTATGCAA AAGGTGGTAA AGTAGGTCTT 450 

TTTGGTGGTG CAGGTGTTGG TAAAACTGTT ATTATTATGG AGCTTATTCA 500 

CAATGTTGCA TTTAAGCATA GCGGCTATTC TGTATTTGCA GGTGTGGGTG 550 

AGAGAACTCG TGAAGGAAAT GACCTTTATA ATGAAATGAA AGAAAGTAAT 600 

GTTTTAGACA AAGTTGCTCT ATGTTATGGA CAAATGAATG AACCACCAGG 650 

GGCAAGAAAT CGTATTGCTT TAACAGGTTT AACAATGGCT GAGTATTTTA 700 

GAGATGAAAT GGGTCTTGAT GTGCTTATGT TTATTGATAA TATCTTTAGA 750 

TTTTCACAAT CAGGTTCTGA AATGTCAGCA CTTTTAGGAA GAATTCCATC 800 

AGCTGTGGGT TATCAACCAA CCTTAGCAAG TGAAATGGGT AAATTCCAAG 850 

AAAGAATTAC TTCAACTAAA AAAGGCTCAA TCACTTCAGT TCAAGCTGTT 900 

TACGTTCCAG CTGATGACTT AACAGACCCA GCTCCAGCAA CTGTTTTTGC 950 
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TCACTTAGAT GCTACAACGG TTTTAAATAG 
TTTATCCTGC AGTTGACCCA CTTGACTCAA 
AATATCATTG GAGAAGAACA TTATAAAGTT 
GCTTCAAAAA TACAAAGATT TACAAGATAT 
ATGAGCTTAG CGAAGAGGAT AAACTTGTAG 
GAAAAATTCT TATCACAACC ATTTTTCGTT 
CCCAGGAAAA TATATAAGCC TTGAAGATAC 
TTTTAGAAGG TAAATATGAT CATTTACCAG 
GGAAATATAG ATGAAGCTAT TGCAAAAGCG 



PCT/CA00/01150 

AGCTATTGCT GAAAAGGGTA 100 0 

CTTCAAGAAT GCTTGACCCA 10 50 

GCTCGTGGTG TTCAATCAGT 1100 

CATCGCCATT TTAGGTATGG 1150 

TTGAAAGAGC AAGAAAGATT 12 0 0 

GCAGAAGTTT TCACAGGTAG 1250 

AATAGCAGGA TTTAAGGGAA 13 00 

AAAATGCTTT CTATATGGTT 13 50 

GATAAATTAA AAGGTTAA 13 98 



2) INFORMATION FOR SEQ ID NO: 2140 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 670 

(C) ACCESSION NUMBER: X67867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2140 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 2 00 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 2 50 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 3 50 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 5 00 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 6 00 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 6 50 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 7 00 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 8 50 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 10 0 0 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 10 50 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 12 00 

TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 12 50 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 13 00 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 13 50 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 14 00 

1104 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TAATACAACA GAATCTAATA 
CTGCTGCTTA TGCTGCCTTT 
TATATCAATA AGGTCGTCTT 
TGTAGGTACA CGAGCTATGA 
TGATGAAAAC TGTCTTGGCA 
TGGTTAGCGC AAGCTGGTAA 
AATTGAAAAA CACATCAAGA 
TTGTTGGTTA TACTCGTAAG 
AATCGTTTAA CTCCTATCGT 
TTATCGCTCA ATGATAACGT 
GGACGATGCC AGACGGACTT 
GGAGCTCGTT CTACGTGGAG 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCA 



AACAATACGG AGCAAGTAGT 
GCCAACGGTG GAATTTACCA 
CAGTGACGGT AGTAAAAAAG 
AAGAAACAAC TGCTTACATG 
TACGGAACTG GTCGTGGAGC 
GACAGGTACT TCTAACTACA 
ACACTGGCTA TGTAGCTCCA 
TATTCTATGG CTGTATGGAC 
TGGAGATGGT TTCCTAGTTG 
ATCTATCAGA AGATACTCAT 
TTCAGAAACG GGGAATTTGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCGATCCTAA 
TGATCAACAA AATCAGAATC 
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GAAAAAATGG 


1450 


CAAACCCATG 


1500 


AATTTTCAGA 


1550 


ATGACCGAAA 


1600 


CTATCTCCCA 


1650 


CAGATGATGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTCAAAAAT 


1950 


CCCCATCAAC 


2000 


TCTAGCTCAA 


2050 


CAATAATACG 


2100 


CTCAACCAGC 


2150 




2157 



2) INFORMATION FOR SEQ ID NO: 2141 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2141 
CACCAGTCAC AGAAAAGC 



2) INFORMATION FOR SEQ ID NO: 2142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2142 
GCCTTAATTT CGGATAGTGC 



2) INFORMATION FOR SEQ ID NO: 2143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2143 
GAGAAAGAGC CCAATTATCT AATGT 25 

2) INFORMATION FOR SEQ ID NO: 2144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3075 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: J2870 

(C) ACCESSION NUMBER: X75439 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2144 

TTGACAAAGA AATATTTAAA CACCCAGAAT GAAATATCAG CATTTTGGAA 50 

TACTCAAAAG ATATTTAAAA AATCAATTGA CAATAGAAAA GGACAGGAAA 100 

GTTTTGTTTT TTATGACGGC CCCCCAACTG CAAATGGCCT TCCTCATGCT 150 

GGCCATGTTC TTGGAAGAGT AATCAAGGAT TTAGTTGCAA GATTAAAAAC 200 

TATGCAAGGT TTTTATGTAG AAAGAAAAGC AGGATGGGAT ACCCATGGCT 250 

TACCAGTTGA ATTAGAGGTT GAAAAAAAAA TTGGAATTAA AGGAAAACAA 3 00 

GACATTGAAA AGTATGGAAT AGAAAATTTT ATAAATGAAT GTAAAAAAAG 350 

TGTATTTAAT TATGAAAAAG AATGGCGGGA TTTTTCTAAA GATTTAGGAT 400 

ACTGGGTTGA CATGGACTCC CCCTATATAA CTCTTGAGAA TAATTATATT 450 

GAAAGTGTAT GGAATATATT ATCTACATTC CATAAAAAAG GACTATTATA 500 

TAAGGGACAT AAGGTGACTC CTTATTGTAC ACATGATCAA ACCGCTTTAA 550 

GTTCTCATGA AGTAGCGCAA GGCTATAAAA ACGTTAAAGA TTTATCAGCT 6 00 

GTTGTTAAAT TTCAACTTAC AAATAGTAAA GATACTTATT TCTTAAGTTG 650 

GACTACCACT CCCTGGACTT TGCCTGCAAA TGTAGCATTA GCTATAAATA 700 

AAGATCTTAA TTATTCAAAA ATTCGGGTAG AAAATGAGTA TTATATCTTA 750 

GCTACAGATC TAATTAATTC TATAATAACT GAAAAATACG AAATTATTGA 800 

TACCTTTTCA GGAAGTAATT TAATTAATTT AAAATACATT CCTCCTTTTG 850 

AAAGCGACGG TTTAGTTAAT GCATATTACG TTGTTGATGG AGAATTTGTT 900 

ACTAACTCAG AAGGAACTGG TATTGTTCAT ATAGCACCAG CTCATGGGGA 950 

AGATGACTAC CAATTGGTTT TAGAGCGTGA TTTGGATTTC TTAAATGTTA 1000 

TAACAAGAGA AGGAGTATAT AATGATAGGT TCCCTGAATT AGTTGGTAAT 1050 

AAAGCTAAAA ATAGTGATAT AGAAATCATA AAATTATTAT CCAAAAAACA 1100 

ACTTTTATAT AAAAAACAAA AATATGAGCA TAATTATCCT CATTGTTGGA 1150 

GATGTGGTAA TCCTTTGATA TATTATGCGA TGGAAGGTTG GTTTATTAAA 1200 

ACAACTAATT TTAAGAATGA AATTATTAAC AATAATAATA ATATAGAGTG 1250 

GTTTCCTTCT CATATTAAGG AAGGGAGAAT GGGAAATTTC TTAGAAAATA 1300 

TGGTTGATTG GAACATTGGT AGAAATAGAT ATTGGGGAAC ACCATTAAAT 1350 

GTATGGATTT GCAATGATTG TAATCACGAA TACGCACCAA GTAGTATTAA 1400 

GGATTTACAA AATAATTCCA TCAATAAAAT TGATGAAGAT ATTGAGTTGC 1450 

ATAGACCTTA TGTTGATAAT ATCACTCTTA GTTGCCCTAA GTGTAATGGG 1500 
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AAAATGTCTC GAGTAGAAGA AGTAATCGAT 
TATGCCGTTT GCTCAGCATC ATTATCCTTT 
ATCAACACTT TCCAGCTGAT TTTATTGCAG 
GGCTGGTTTT ACAGTTTACT AGTAATTTCT 
TTCTTATAAA CGTGCTTTAT CTTTAGGACA 
AAAAAATGTC TAAAAGTAAA GGAAACGTTA 
AATAAGTACG GAGCCGATTC TTTAAGATGG 
TCCATGGAAT AACAAAAGAT TCTCAGAAAA 
CGAAATTTAT AGATACGCTT GATAATATTT 
AATAAAATAG ATCACTATAA TCCTAATAAT 
TACATTAGAT AATTGGGCTC TTTCTCGCTT 
GTAATATTTA TGTAAATAAT TACGATTTCA 
AACGAATATA CCAATACAAT AAGTAATTGG 
ACGATTTTGG GAACAAGGAA TTTCTAACGA 
CGCTTTATGA AATTTTAACA ACTTTATCAA 
CCATTTATAT CTGAAAAAAT CCATTATAAT 
TTTACAAGAT TATCCACAAT ATAAAGAAAG 
AAGATGAAAT GCATACCGTT ATAAAAATTG 
CGCAAAAATG CAGATTTAAA AATTAAGCAA 
TAAACCTAAT AGTCAATTAA ACTTAAGTTT 
TAATAAAAGA CGAATTAAAT ATAAAAAACA 
AATGACTATA TTACCTATGA GCTTAAATTG 
AAAACTAGGG AACAAAACGA AAAATATTCA 
CAGAGTATGA TAAAAAAAGT TTAATTGAGT 
TCTTCTGATG CTGAGTTAAC TAAGGATGAT 
TAAGGATAGT TATCAACTCA GTGAAGATAA 
ATAAAAATTT ATCTCCTGAA TTAATTCGCG 
ATTAGATTAA TTCAACAATT AAGAAAAAAG 
ACGTATTGAT ATTTATATCG GTGTAACTGG 
AAACCAATAA AAATATGTTT AAAGAAAATT 
TTAAATGTTA TAGATGAATA TGAAAATACT 
AATAAAAATT TCCTTATTAT ATTAA 



GTTTGGTTTG ATAGCGGCTC 1550 

TGATAACCAG AAAATTTTTA 1600 

AAGGAGTTGA TCAAACGAGA 1650 

ACTATTCTAA AAGGAAAATC 1700 

TATTCTAGAC AGTAATGGTA 1750 

TTAATCCAAC TGAATTAATT 18 00 

GCCTTAATTT CGGATAGTGC 1850 

TATAGTAGCT CAGACCAAAT 1900 

ATAAATTTTA TAATATGTAT 1950 

GAAATTACAA AAAGTAGAAA 2 00 0 

AAACACCTTA ATAAAAGAAA 2 050 

CTTCCGCAGC CAGATTAATT 2100 

TATATCGGAG ATTCGAGAGG 2150 

TAAAAAAGAT GCGTACAATA 220 0 

GACTAGTGGC TCCATTTGTT 225 0 

TTGACTGGAA AAAGTGTGCA 230 0 

TTTTATTAAT CAAGCATTGG 2350 

TAGAATTATC TAGACAGGCT 24 00 

CCTTTATCGA AAATGGTGAT 2450 

TTTACCTAAT TACTATTCAA 2 500 

TTGAATTAAC TGATAATATT 2 550 

AATTTTTCTT CTGTGGGACC 2600 

AACATTGATA GACTCCCTAT 2 6 50 

CTAATAACTT CAAAAGTTTA 2 70 0 

TTTATAATTA AAACCTTACC 2 750 

TGACTGCGTT ATATTATTAG 2 80 0 

AAGGACATGC TAGAGAGCTC 2 850 

AAAAATTTAC CAATAAATCA 2 90 0 

GGAATTATTA GAATCAATAA 2 950 

TCGTGATTAA AAATATACAC 3 000 

ATTCATTTTA ATAATAAAGA 3 050 

3075 



2) INFORMATION FOR SEQ ID NO: 2145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2145 
GCAAGATGTG GCGTGTTACG GT 



2) INFORMATION FOR SEQ ID NO: 2146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2146 
GGGGCGAAGA AGTTGTCCAT ATT 



2) INFORMATION FOR SEQ ID NO: 2147 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00622 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2147 



2) INFORMATION FOR SEQ ID NO: 2148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2148 
CAGATTAAAT GCGGATTCAG CC 




CGGGGCGTAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
660 
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2) INFORMATION FOR SEQ ID NO: 2149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2149 
AT CAGGT AAA TCATCAGCGG ATA 



2) INFORMATION FOR SEQ ID NO: 2150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

(C) ACCESSION NUMBER: X537 96 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2150 



50 
100 
150 



ATGAATTTTA CCCGGATTGA CCTGAATACC TGGAATCGCA GGGAACACTT 
TGCCCTTTAT CGTCAGCAGA TTAAATGCGG ATTCAGCCTG ACCACCAAAC 
TCGATATTAC CGCTTTGCGT ACCGCACTGG CGGAGACAGG TTATAAGTTT 

TATCCGCTGA TGATTTACCT GATCTCCCGG GCTGTTAATC AGTTTCCGGA 200 

GTTCCGGATG GCACTGAAAG ACAATGAACT TATTTACTGG GACCAGTCAG 2 50 

ACCCGGTCTT TACTGTCTTT CATAAAGAAA CCGAAACATT CTCTGCACTG 3 00 

TCCTGCCGTT ATTTTCCGGA TCTCAGTGAG TTTATGGCAG GTTATAATGC 3 50 

GGTAACGGCA GAATATCAGC ATGATACCAG ATTGTTTCCG CAGGGAAATT 400 

TACCGGAGAA TCACCTGAAT ATATCATCAT TACCGTGGGT GAGTTTTGAC 450 

GGAT??AACC TGAACATCAC CGGAAATGAT GATTATTTTG CCCCGGTTTT 500 

?ACgItGGCA AAGTTTCAGC AGGAAGGTGA CCGCGTATTA TTACCTGTTT 550 

CTGTACAGGT TCATCATGCA GTATGTGATG GCTTTCATGC AGCACGGTTT 600 

ATTAATACAC TTCAGCTGAT GTGTGATAAC ATACTGAAAT AA 642 



2) INFORMATION FOR SEQ ID NO: 2151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2151 
ATATTTCAGC ATTACCTTGG GTT 

2) INFORMATION FOR SEQ ID NO: 2152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2152 
TACACAACTC TTGTAGCCGA TTA 



PCT/CA00/01150 



23 



2) INFORMATION FOR SEQ ID NO: 2153 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexineri 
(C) ACCESSION NUMBER: X07848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2153 

ATGAACTATA CAAAATTTGA TGTAAAAAAT TGGGTTCGCC GTGAGCATTT 50 

TGAGTTTTAT CGGCATCGTT TACCATGTGG TTTTAGCTTA ACAAGCAAAA 100 

TTGATATCAC GACGTTAAAA AAGTCATTGG ATGATTCAGC GTATAAGTTT 150 

TATCCGGTAA TGATCTATCT GATTGCTCAG GCCGTGAATC AATTTGATGA 2 00 

GTTGAGAATG GCGATAAAAG ATGATGAATT GATCGTATGG GATTCAGTCG 250 

ACCCACAATT CACCGTATTC CATCAAGAAA CAGAGACATT TTCAGCACTG 3 00 

AGTTGCCCAT ACTCATCCGA TATTGATCAA TTTATGGTGA ATTATTTATC 350 

GGTAATGGAA CGTTATAAAA GTGATACCAA GTTATTTCCT CAAGGGGTAA 400 

CACCAGAAAA TCATTTAAAT ATTTCAGCAT TACCTTGGGT TAATTTTGAT 450 

AGCTTTAATT TAAATGTTGC TAATTTTACC GATTATTTTG CACCCATTAT 500 

AACAATGGCA AAATATCAGC AAGAAGGGGA TAGACTGTTA TTGCCGCTCT 550 

CAGTACAGGT TCATCATGCA GTTTGTGATG GCTTCCATGT TGCACGCTTT 600 

ATTAATCGGC TACAAGAGTT GTGTAACAGT AAATTAAAAT AA 642 
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2) INFORMATION FOR SEQ ID NO: 2154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2154 
CGCCATTCAG AGTTTAGGAC 



2) INFORMATION FOR SEQ ID NO: 2155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2155 
TTCCATACCG TTGCGTATCA CTT 



2) INFORMATION FOR SEQ ID NO: 2156 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: CP590 

(C) ACCESSION NUMBER: M74769 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2156 

ATGGTATTTG AAAAAATTGA TAAAAATAGT TGGAACAGAA AAGAGTATTT 
TGACCACTAC TTTGCAAGTG TACCTTGTAC ATACAGCATG ACCGTTAAAG 
TGGATATCAC ACAAATAAAG GAAAAGGGAA TGAAACTATA TCCTGCAATG 
CTTTATTATA TTGCAATGAT TGTAAACCGC CATTCAGAGT TTAGGACGGC 
AATCAATCAA GATGGTGAAT TGGGGATATA TGATGAGATG ATACCAAGCT 
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ATACAATATT TCACAATGAT ACTGAAACAT 
TGTAAGTCTG ACTTTAAATC ATTTTTAGCA 
ACGGTATGGA AACAATCATA GAATGGAAGG 
ACATTTTTAA TGTATCTATG ATACCGTGGT 
CTGAATTTGC AGAAAGGATA TGATTATTTG 
GAAATATTAT AAAGAAGATA ACAAAATTAT 
TTCATCACGC AGTATGTGAC GGATTTCACA 
TTGCAGGAAT TGATAAATAG TTAA 
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TTTCCAGCCT TTGGACTGAG 30 0 

GATTATGAAA GTGATACGCA 3 50 

AAAGCCAAAT GCTCCGGAAA 4 00 

CAACCTTCGA TGGCTTTAAT 450 

ATTCCTATTT TTACTATGGG 50 0 

ACTTCCTTTG GCAATTCAAG 550 

TTTGCCGTTT TGTAAACGAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 2157 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2157 
CCACAGAAAT TGATATTAGT GTTTTAT 



2) INFORMATION FOR SEQ ID NO: 2158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2158 
TCGCTATTGT AACCAGTTCT A 



2) INFORMATION FOR SEQ ID NO: 2159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01277 
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ATGAACTTTA ATAAAATTGA TTTAGACAAT 
TAATCATTAT TTGAACCAAC AAACGACTTT 
ATATTAGTGT TTTATACCGA AACATAAAAC 
CCTGCATTTA TTTTCTTAGT GACAAGGGTG 
TAGAACTGGT TACAATAGCG ACGGAGAGTT 
AGCCACTTTA TACAATTTTT GATGGTGTAT 
TGGACTCCTG TAAAGAATGA CTTCAAAGAG 
TGATGTAGAG AAATATAATG GTTCGGGGAA 
TACCTGAAAA TGCTTTTTCT CTTTCTATTA 
GGGTTTAACT TAAATATCAA TAATAATAGT 
TACAGCAGGA AAATTCATTA ATAAAGGTAA 
CTTTACAGGT ACATCATTCT GTTTGTGATG 
ATGAACTCTA TTCAGGAATT GTCAGATAGG 
A 



TGGAAGAGAA AAGAGATATT 50 

TAGTATAACC ACAGAAATTG 100 

AAGAAGGATA TAAATTTTAC 150 

ATAAACTCAA ATACAGCTTT 200 

AGGTTATTGG GATAAGTTAG 250 

CTAAAACATT CTCTGGTATT 3 00 

TTTTATGATT TATACCTTTC 3 50 

ATTGTTTCCC AAAACACCTA 400 

TTCCATGGAC TTCATTTACT 4 50 

AATTACCTTC TACCCATTAT 500 

TTCAATATAT TTACCGCTAT 550 

GTTATCATGC AGGATTGTTT 600 

CCTAATGACT GGCTTTTATA 650 

651 



2) INFORMATION FOR SEQ ID NO: 2160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2160 
TTTTGAACAC TATTTTAACC AGC 



2) INFORMATION FOR SEQ ID NO: 2161 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2161 
GATTTAACTT ATCCCAATAA CCT 



2) INFORMATION FOR SEQ ID NO: 2162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
<C) ACCESSION NUMBER: X02166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2162 

ATGACTTTTA ATATTATCAA ATTAGAAAAT TGGGATAGAA AAGAATATTT 50 

TGAACACTAT TTTAACCAGC AAACTACGTA TAGCATTACT AAAGAAATTG 100 

ATATTACTTT GTTTAAAGAT ATGATAAAAA AGAAAGGATA TGAAATTTAT 150 

CCTTCTTTGA TTTATGCAAT TATGGAAGTT GTAAATAAAA ATAAAGTGTT 200 

TAGAACAGGA ATTAATAGTG AGAATAAATT AGGTTATTGG GATAAGTTAA 250 

ATCCTTTGTA TACAGTTTTT AATAAGCAAA CTGAAAAATT TACTAACATT 300 

TGGACTGAAT CTGATAACAA CTTCACTTCT TTTTATAATA ATTATAAAAA 3 50 

TGACTTGCTT GAATATAAAG ATAAAGAAGA AATGTTTCCT AAAAAACCGA 4 00 

TACCTGAAAA CACCATACCG ATTTCAATGA TTCCTTGGAT TGATTTTAGT 4 50 

TCATTTAATT TAAACATTGG TAACAATAGC AACTTTTTAT TGCCTATTAT 500 

TACGATAGGT AAATTTTATA GTGAGAATAA TAAAATTTAT ATACCAGTTG 550 

CTTTGCAGCT TCATCATGCT GTATGTGATG GTTACCATGC TTCATTATTT 60 0 

ATGAATGAAT TTCAAGATAT AATTCATAAG GTAGATGATT GGATTTAG 64 8 

2) INFORMATION FOR SEQ ID NO: 2163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2163 
ACCTTCATCC TACCGATGTG GGTT 



2) INFORMATION FOR SEQ ID NO: 2164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2164 
CAACGACACC AGCACTGCCA TTG 



2) INFORMATION FOR SEQ ID NO: 2165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: H3380 

(C) ACCESSION NUMBER: AF071555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2165 

ATGACCACCA CACGCCCCGC GTGGGCCTAT ACGCTGCCGG CAGCACTGCT 50 

GCTGATGGCT CCTTTCGACA TCCTCGCTTC ACTGGCGATG GATATTTATC 100 

TCCCTGTCGT TCCAGCGATG CCCGGCATCC TGAACACGAC GCCCGCTATG 150 

ATCCAACTCA CGTTGAGCCT CTATATGGTG ATGCTCGGCG TGGGCCAAGT 2 00 

GATTTTTGGT CCGCTCTCAG ACAGAATCGG GCGACGGCCA ATTCTACTTG 2 50 

CGGGCGCAAC GGCTTTCGTC ATTGCGTCTC TGGGAGCAGC TTGGTCTTCA 3 00 

ACTGCACCGG CCTTTGTCGC TTTCCGTCTA CTTCAAGCAG TGGGCGCGTC 3 50 

GGCCATGCTG GTGGCGACGT TCGCGACGGT TCGCGACGTT TATGCCAACC 4 00 

GTCCTGAGGG TGTCGTCATC TACGGCCTTT TCAGTTCGAT GCTGGCGTTC 450 

GTGCCTGCGC TCGGCCCTAT CGCCGGAGCA TTGATCGGCG AGTTCTTGGG 500 

ATGGCAGGCG ATATTCATTA CTTTGGCTAT ACTGGCGATG CTCGCACTCC 550 

TAAATGCGGG TTTCAGGTGG CACGAAACCC GCCCTCTGGA TCAAGTCAAG 6 00 

ACGCGCCGAT CTGTCTTGCC GATCTTCGCG AGTCCGGCTT TTTGGGTTTA 650 

CACTGTCGGC TTTAGCGCCG GTATGGGCAC CTTCTTCGTC TTCTTCTCGA 70 0 

CGGCTCCCCG TGTGCTCATA GGCCAAGCGG AATATTCCGA GATCGGATTC 75 0 

AGCTTTGCCT TCGCCACTGT CGCGCTTGTA ATGATCGTGA CAACCCGTTT 800 

CGCGAAGTCC TTTGTCGTCA GATGGGGCAT CGCAGGATGC GTGGCGCGTG 8 50 

GGATGGCGTT GCTTGTTTGC GGAGCGGTCC TGTTGGGGAT CGGCGAACTT 900 

TACGGCTCGC CGTCATTCCT CACCTTCATC CTACCGATGT GGGTTGTCGC 950 

GGTCGGTATT GTCTTCACGG TGTCCGTTAC CGCGAACGGC GCTTTGGCAG 10 00 

AGTTCGACGA CATCGCGGGA TCAGCGGTCG CGTTCTACTT CTGCATCCAA 10 50 

AGCCTGATAG TCAGTATCGT CGGGACATTG GCGGTGACGC TGTTAAACGG 1100 

CGATACAGCG TGGCCCGTGA TTTGTTACGC CACGGCAATG GCAGTGCTGG 1150 

TGTCGTTGGG GCTGGCGCTC CTTCGATCCC GTGATGCTGC CACCGAGAAG 12 00 

TCGCCAGTCG TCTAG 1215 



2) INFORMATION FOR SEQ ID NO: 2166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2166 
GACAAACCAT TCCTGCTG 
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2) INFORMATION FOR SEQ ID NO: 2167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2167 
CAGCAGCTGG GCGGCGGT 



2) INFORMATION FOR SEQ ID NO: 2168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2168 
CATCAAAGTT GGTGAAGAAG TTG 



2) INFORMATION FOR SEQ ID NO: 2169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2169 
CCCGTTTGCG AAAGGTGG 



2) INFORMATION FOR SEQ ID NO: 2170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2170 
ACGTGACGTT GACAAACCA 



2) INFORMATION FOR SEQ ID NO: 2171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2171 
TCGTTGGATT AACTGAAGAA 
2) INFORMATION FOR SEQ ID NO: 2172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2172 
GTGTTGAAAT GTTCCGTAAA 



2) INFORMATION FOR SEQ ID NO: 2173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2173 
AAGAAAAAAT CTTCGAACTG GCTA 



2) INFORMATION FOR SEQ ID NO: 2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2174 
TCTACACGGC CGGTG 



2) INFORMATION FOR SEQ ID NO: 2175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2175 
CCGCCATACC CCGTTT 



2) INFORMATION FOR SEQ ID NO: 2176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2176 
CGGCATTACC ATTTCCACAC CTTT 



2) INFORMATION FOR SEQ ID NO: 2177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2177 

GGCACGGACA AACCATTCCT GCTGCCTAHC GAAGACGTGT TCCCGTGCC 



2) INFORMATION FOR SEQ ID NO: 2178 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 2178 
GGCACGACAA ACCATTCCTG CTGCCTATCG AACGTGCC 



2) INFORMATION FOR SEQ ID NO: 2179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2179 
GGCAGCTCTA CTTCCGTACC AC T G AC GT AA CCGGCTGCC 



2) INFORMATION FOR SEQ ID NO: 2180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2180 
TTCGCCGGCG TGGGC 



2) INFORMATION FOR SEQ ID NO: 2181 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2181 
AGCGCCACGC GCAGG 15 



2) INFORMATION FOR SEQ ID NO: 2182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2182 
GCGCGCCAAC GACTTCTACC ACGAAATGGA AGAGTCGCGC GC 42 
2) INFORMATION FOR SEQ ID NO: 2183 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2183 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 
ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 
AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 
GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 
CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 
CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 
TCCTACGCCT GAGCGTGCCG TTGACGGTAC GTTCCTGATG CCTGTTGAAG 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG TTGTGACCGG TCGTATTGAG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC GAAATCGTGG GTATCAAAGA 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 
ACCAGGGCGA AGCTGGCGAT AACGTCGGTC TGCTGCTGCG TGGTACCAAG 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG GCCAAGCCAG GCTCGATCAA 
GCCACACACT GACTTCGACG CCGAGGTGTA CATTCTGTCC AAAGAAGAAG 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT ACCGTCCTCA GTTCTACTTC 
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PCT/CA00/01150 

ACAAGGAAAT 750 
ATCGCTCCTA 800 

817 



2) INFORMATION FOR SEQ ID NO: 2184 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 4347 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2184 

TACAATTGAA GTTGAAAGAT CTATGCGTGT TCTTGATGGT GCTGTTGCGG 50 

TATTTTGTTC AGTGGGTGGG GTGCAGCCTC AAAGTGAAAC AGTTTGGAGA 100 

CAAGCAAATA AATATGGTGT TCCAAGAATA GTATTTGTAA ATAAAATGGA 150 

TAGAATCGGT GCAAATTTCT ACAATGTAGA AGATCAAATT CGCAACCGTT 2 00 

TAAAAGCTAA TCCAGTTCCA CTTCAAATTC CAATCGGTGC TGAGGATAAT 250 

TTTAAAGGCG TAATCGATCT TGTAACTATG AAAGCTTTAG TTTGGGAAGA 3 00 

TGATACTAAG CCAACGGATT ATGTAGAAAA AGAAATTCCA GCTGAACTTA 3 50 

AAGAAAAGGC AGAAGAATAT CGCACAAAAA TGATAGAAGC AGTTTCTGAA 400 

ACTTCAGATG AGTTGATGGA AAAATATTTA GGTGGAGAAG AATTAAGCCT 450 

TGAAGAGATT AAAACAGGGA TTAAAGCAGG ATGTTTAAGT CTTTCTATCG 500 

TTCCTATGCT TTGCGGTACA GCGTTTAAAA ATAAAGGGGT TCAACCTTTG 550 

CTTGATGCTG TTGTGGCTTA TTTACCAGCT CCTGATGAAG TTGCTAATAT 600 

CAAAGGGGAA TATGAAGACG GCACAGAAGT TTCTGTAAAA TCAACTGATG 650 

ATGGCGAATT TGCAGGACTT GCATTTAAAA TTATGACAGA TCCATTTGTA 700 

GGACAACTTA CTTTCGTGCG TGTTTATCGT GGATGTTTAG AAAGCGGTTC 750 

TTATGCTTAT AACTCAACCA AAGATAAAAA AGAAAGAATT GGTCGTTTGT 800 

TAAAAATGCA CTCTAATAAA AGAGAAGAAA TTAAAGTTCT TTACGCAGGA 850 

GAAATCGGTG CAGTTGTAGG ACTTAAAGAT ACTTTAACAG GGGATACTCT 900 

TGCAAGTGAA AAAGATAAAG TAATTCTTGA AAGAATGGAT TTCCCAGACC 950 

CAGTTATTTC TGTTGCAGTA GAMCCAAAAA CTAAAGCAGA TCAAGAAAAA 1000 

ATGTCTATTG CACTAAATAA ATTAGCTCAA GAAGATCCAA GCTTTAGAGT 1050 

TTCTACGGAT GAAGAAAGTG GTCAAACTAT CATTTCAGGT ATGGGTGAAT 1100 

TACACCTTGA AATTATTGTT GATCGTATGC TTCGTGAATT TAAAGTGGAA 1150 

GCTGAAGTGG GACAACCTCA AGTTGCTTAT CGTGAAACTA TCAGAAAAAC 1200 

TGTTGAGCAA GAATACAAAT ACGCTAAGCA ATCAGGCGGT CGTGGTCAGT 1250 

ATGGACATGT ATTCTTACGC CTTGAGCCAC TTGAGCCAGG AAGTGGATAC 13 00 

GAGTTTGTTA ACGACATCAA AGGTGGGGTA ATTCCAAAAG AATATATTCC 1350 

TGCAGTAGAT AAGGGTGTTC AAGAAGCATT GCAAAATGGT GTTTTAGCAG 1400 

GTTATCCAGT AGAAGATGTT AAAGTAACTG TTTATGATGG AAGTTATCAC 1450 

GAGGTGGATT CATCTGAGAT GGCATTTAAA CTTGCTGCTT CTATGGGATT 1500 

TAAAGAGGGT GCTAGAAAAG CAGGTGCTGT GATCTTAGAG CCTATGATGA 1550 

AAGTTGAAGT AGAAACTCCT GAAGATTACA TGGGCGATGT TATTGGCGAT 1600 

CTTAATAAGC GTCGTGGTCA AGTAAATAGC ATGGATGAAA GAGGTGGTAA 1650 

TA 1652 
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2) INFORMATION FOR SEQ ID NO: 2185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2185 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 2 00 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 3 50 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 40 0 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 

ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC 8 00 

TATGGCTGAA GGCGAGAGAT 82 0 



2) INFORMATION FOR SEQ ID NO: 2186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2186 
ACGCGCTCAA AGCAGAAGTA TACGTATTAT CAAAAGACGC GCGT 



2) INFORMATION FOR SEQ ID NO: 2187 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2187 

CAGTGGGTGG GGTGCAGCCT CAAAGTGAAA CAGTTTGGAG ACAAGCAAAT ' 50 

AAATATGGTG TTCCAAGAAT AGTATTTGTA AACAAAATGG ATAGAATCGG 100 

TGCAAATTTC TACAATGTAG AAGATCAAAT TCGCAACCGT TTAAAAGCTA 150 

ATCCAGTTCC ACTTCAAATT CCAATCGGTG CTGAGGATAA TTTTAAAGGC 200 

GTAATCGATC TTGTAACTAT GAAAGCTTTA GTTTGGGAAG ATGATACTAA 250 

GCCAACGGAT TATGTAGAAA AAG AAATT C C AGCTGAACTT AAAGAAAAGG 3 00 

CAGAAGAATA TCGCACAAAA ATGATAGAAG CAGTTTCTGA AACTTCAGAT 350 

GAGTTGATGG AAAAATATTT AGGCGGAGAA GAATTAAGCC TTGAAGAGAT 400 

TAAAACAGGG ATTAAAGCAG GATGTTTAAG TCTTTCTATC GTTCCTATGC 450 

TTTGCGGTAC AGCGTTTAAA AATAAAGGGG TTCAACCTTT ACTTGATGCT 500 

GTTGTGGCTT ATTTACCAGC TCCTGATGAA GTGGCAAATA TCAAGGGTGA 550 

ATATGAAGAT GGCACAGAAG TTTCTGTAAA ATCAACTGAT GATGGCGAGT 600 

TTGCAGGACT TGCATTTAAA ATTATGACAG ATCCATTTGT AGGACAACTT 650 

ACTTTCGTGC GTGTTTATCG CGGTTGTTTA GAAAGTGGTT CTTATGCTTA 700 

TAACTCAACT AAAGATAAAA AAGAAAGAAT TGGTCGTTTG TTAAAAATGC 750 

ACTCTAACCA AAGAGAAGAG ATTAAAGTGC TTTACGCAGG CGAAATTGGT 800 

GCTGTTGTAG GACTTAAAGA TACTTTAACA GGGGATACTC TTGTAAGTGA 850 

AAAAGATAAG GTAATCCTTG AAAGAATGGA TTTTCCAGAT CCAGTTATTT 900 

CTGTTGCAGT TGAGCCAAAA ACTAAAGCAG ATCAAGAAAA AATGTCTATT 950 

GCTTTAAATA AATTAGCACA AGAAGATCCA AGTTTTAGAG TTTCTACAGA 1000 

TGAAGAAAGT GGCCAAACTA TCATTTCAGG TATGGGTGAG TTACACCTTG 1050 

AAATTATCGT TGATAGAATG CTTCGTGAAT TTAAAGTTGA AGCTGAAGTA 1100 

GGTCAACCAC AAGTTGCTTA TCGCGAAACT ATTAGAAAAA CTGTTGAACA 1150 

AGAATACAAA TACGCTAAAC AATCAGGTGG TCGTGGTCAG TATGGACATG 1200 

TATTCTTACG CCTTGAACCA CTTGAGCCAG GTAGTGGATA TGAATTTGTT 1250 

AATGATATCA AAGGTGGAGT AATTCCAAAA GAATACATTC CTGCAGTTGA 13 00 

TAAAGGTGTT CAAGAAGCAT TACAAAATGG TGTTTTAGCA GGTTATCCTG 1350 

TGGAAGATGT TAAAGTAACT GTTTATGATG GAAGTTATCA CGAGGTGGAT 1400 

TCATCTGAGA TGGCGTTTAA ACTTGCTGCT TCTATGGGCT TTAAAGAAGG 1450 

TGCTAGAAAA GCAGGTGCTG TGATCTTAGA GCCTATGATG AAAGTTGAAG 1500 

TAGAAACTCC TGAAGATTAC ATGGGTGATG TTATTGGAGA TCTTAACAAA 1550 

CGCCGTGGTC AAGTAAATAG CATGGATGAG CGTGGTGGAA ATAAAATCAT 1600 

CACAGCATTT TG 1612 



2) INFORMATION FOR SEQ ID NO: 2188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2188 

ACAATTGAAG TTGAAAGATC TATGCGTGTT CTTGATGGTG CTGTTGCGGT 50 

ATTTTGTTCA GTGGGTGGGG TGCAGCCTCA AAGTGAAACA GTTTGGAGAC 100 

AAGCAAATAA ATATGGTGTT CCAAGAATAG TATTTGTAAA TAAAATGGAT 150 

AGAATCGGTG CAAATTTCTA CAATGTAGAA GATCAAATTC GCAACCGTTT 200 

AAAAGCTAAT CCAGTTCCAC TTCAAATTCC AATTGGTGCT GAGGATAATT 250 

TTAAAGG CGT AATCGATCTT GTAACTATGA AAGCTTTAGT TTGGGAAGAT 3 00 

GATACTAAGC CAACGGATTA TGTAGAAAAA GAAATTCCAG CTGAACTTAA 350 

AGAAAAGGCA GAAGAATATC GCACAAAAAT GATAGAAGCA GTTTCTGAAA 400 

CTTCAGATGA GTTGATGGAA AAATATTTAG GCGGAGAAGA ATTAAGCCTT 450 

GAAGAGATTA AAACAGGGAT .TAAAGCAGGA TGTTTAAGTC TTTCTATCGT 50 0 

TCCTATGCTT TGCGGTACAG CGTTTAAAAA TAAAGGGGTT CAACCTTTGC 550 

TTGATGCTGT TGTGGCTTAT TTACCAGCTC CTGATGAAGT GGCAAATATC 6 00 

AAGGGTGAAT ATGAAGATGG CACAGAAGTT TCTGTAAAAT CAACTGATGA 650 

TGGCGAGTTT GCAGGACTTG CATTTAAAAT CATGACAGAT CCATTTGTAG 700 

GACAACTTAC TTTCGTGCGT GTTTATCGCG GTTGTTTAGA AAGCGGTTCT 750 

TATGCGTATA ACTCAACTAA AGATAAAAAA GAAAGAATTG GTCGTTTGTT 8 00 

AAAAATGCAC TCTAACAAAA GAGAAGAGAT TAAAGTGCTT TACGCAGGCG 850 

AAATTGGTGC TGTTGTAGGA CTTAAAGATA CTTTAACAGG GGATACTCTT 900 

GCAAGTGAAA AAGATAAGGT AATCCTTGAA AGAATGGATT TTCCAGATCC 950 

AGTTATTTCT GTTGCAGTTG AGCCAAAAAC TAAAGCTGAT CAAGAAAAAA 100 0 

TGTCTATTGC TTTAAATAAA TTAGCACAAG AAGATC CAAG TTTTAGAGTT 1050 

TCTACAGATG AAGAAAGTGG TCAAACTATC ATTTCAGGTA TGGGTGAGTT 1100 

ACACCTTGAA ATTATCGTTG ATAGAATGCT TCGTGAATTT AAAGTTGAAG 1150 

CTGAAGTAGG TCAACCACAA GTTGCTTATC GCGAAACTAT TAGAAAAACT 1200 

GTTGAACAAG AATACAAATA CGCTAAACAA TCAGGTGGTC GTGGTCAGTA 1250 

TGGACATGTA TTCTTACGCC TTGAACCACT TGAGCCAGGT AGTGGATATG 13 00 

AATTTGTTAA TGATATCAAA GGTGGGGTAA TTCCAAAAGA ATACATTCCT 13 50 

GCAGTTGATA AAGGTGTTCA AGAAGCATTA CAAAATGGTG TTTTAGCAGG 14 00 

TTATCCTGTG GAAGATGTTA AAGTAACTGT TTATGATGGA AGTTATCACG 14 50 

AGGTGGATTC ATCTGAGATG GCGTTTAAAC TTGCTGCTTC TATGGGCTTT 150 0 

AAAGAAGGTG CTAGAAAAGC AGGCGCTGTG ATCTTAGAGC CTATGATGAA 1550 

AGTTGAAGTA GAAACTCCTG AAGATTATAT GGGTGATGTT ATTGGAGATC 16 0 0 

TTAACAAACG CCGTGGTCAA GTAAATAGCA TGGATGAGCG TGGTGGAAAT 1650 

AAAATCATCA CAGCATT 1667 



2) INFORMATION FOR SEQ ID NO: 2189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGACGGCGCT GGACGTGACG GAGGACCTCG 
CTGGAGATCG TGCAGCACTT GGATGCGAAC 
GCAGACGACG GACCTGCTGA AGCTGAAGTC 
GCAACATCTC CGTGCCGGTG GGCCGCGAGA 
GTGCTGGGCG ACGCGATTGA CCACCGCGGC 
CATGGCGATC CACGCCGAGG CGCCGAAGCT 
ACACGATCCT GACGACCGGC ATCAAGGTGA 
TGCAAGGGCG GCAAGATCGG CCTGTTCGGC 
TGTGATCATC ATGGAGCTGA TCAACAACGT 
TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA 
TACCTGGAGA TGATGCAGTC AAAGGTGATT 
GTGCGTGCTT GTGTACGGGC AGATGAACGA 
GCGTTGCGCA GTCTGCGCTG ACGATGGCCG 
GGCCAGAACG TGCTGCTGTT CATCGACAAC 
GAACTCCGAG GTGTCTGCGC TGCTGGGCCG 
ACCAGCCGAC GCTTGCGGAG GATCTTGGCA 
TCGACGACGA AGGGATCGAT TACGTCTGTG 
GGATGATATC ACGGACCCCG CGCCCGCGAC 
CGACGACTGT GCTGGACCGC GCGGTGGCGG 
GTGAACCCGC TGGAGTGCGC GTCGCGCATC 
CGTGGACCAT TACAACGTTG CACAGGATAT 
ACAAGGAGCT GCAGGACATC ATTGCGGTGC 
GAGGAGGACA AGGTCGTGGT GGACCGCGCG 
GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT 
ACGTGCAGCT GAGCGACACG GTGGAGTCGT 
TCGTA 



PCT/CA00/01150 

guyanensis 



ID NO: 2189 

GCCGCGATGA GCCGCTGACG 50 

ACCGGCCGCT GCATTGCGAT 100 

GAAGGTTGTG TCGACCGGCG 150 

CACTGGGCCG CATCTTCAAC 2 00 

CCCGTGTGCG AGAAGATGCG 250 

GGCGGACCAG GCTGCGGAGG 3 00 

TCGACCTGAT TCTGCCCTAC 350 

GGTGCCGGTG TGGGCAAGAC 4 00 

CGCGAAGGGG CACGGCGGCT 4 50 

CGCGCGAGGG CACGGACCTG 500 

GACCTGAAGG GCGAGTCGAA 550 

GCCCCCGGGT GCGCGCGCGC 600 

AGTACTTCCG CGACGTGGAG 650 

ATCTTCCGCT TCACGCAGGC 70 0 

CATCCCGGCC GCCGTGGGTT 750 

TGCTGCAGGA GCGCATTACG 800 

CAGGCTGTGT ACGTGCCTGC 850 

GACGTTCTCG CACCTGGACG 900 

AGTCGGGCAT CTACCCTGCC 950 

ATGGACCCCG ATGTGATCGA 1000 

CGTCCAGATG CTGACCAAGT 1050 

TTGGCATCGA CGAGCTGAGC 1100 

CGCAAGGTGA CGCGGTTCCT 1150 

CACCGGCATG ACGGGCCACT 12 00 

TCTCTGGCCT GCTGATGGGG 12 50 

1255 



2) INFORMATION FOR SEQ ID NO: 2190 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATRO 7 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2190 

GCCCTTGACG TTGTTGACAA ACTTGGCCGT GATGAGCCTC TTACTCTTGA 5 0 

GATCGTGCAG CATCTTGACG CCCACACGGG CCGCTGTATC GCGATGCAAA 100 

CGACGGATCT CCTCAAACTG AAGGCAAAGG TCGTTTCGAC AGGTGGCAAC 150 

ATTTCCGTTC CTGTCGGCCG GGAAACACTA GGTCGTATCT TCAACGTGCT 20 0 

TGGAGACGCT ATTGACCAGC GCGGCCCCGT TGGTGAGAAA CTGCGCATGC 250 

CCATCCATGC CGTGGCTCCC AAGCTTGCGG ACCAGGCCGC TGAGGATGCG 30 0 

GTGCTCACAA CTGGTATTAA GGTGATTGAT CTCATTCTCC CTTACTGCAA 3 50 
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AGGTGGAAAA ATTGGCCTCT TTGGGGGTGC GGGTGTGGGC AAAACCGTCA 400 

TTATTATGGA GCTCATTAAC AACGTTGCCA AGGGTCACGG TGGTTTCTCT 450 

GTCTTCGCTG GTGTTGGTGA GCGTACCCGT GAGGGAACGG ATTTGTATCT 50 0 

TGAGATGATG CAGTCTAAGG TTATTGACCT TAAGGGTGAG TCCAAATGTG 550 

TGTTGGTGTA CGGTCAGATG AACGAGCCCC CAGGTGC CCG TGCGCGTGTT 6 00 

GCGCAGTCGG CTCTGACGAT GGCTGAGTAC TTCCGTGATG TGGAGGGCCA 650 

AGATGTGCTT CTTTTTATCG ACAATATTTT TCGTTTCACT CAGGCTAACT 700 

CCGAGGTGTC GGCGCTTCTG GGTCGTATTC CCGCCGCTGT TGGCTACCAG 750 

CCTACCCTCG CTGAGGATCT AGGGCAGTTG CAGGAGCGTA TTACCTCAAC 800 

AACGAAAGGC TCCATTACTT CTGTGCAGGC CGTATACGTG CCGGCCGATG 850 

ACATTACCGA TCCAGCTCCA GCAACAACCT TCTCACATCT GGACGCCACA 900 

ACTGTGTTGG ACCGTGCTGT TGCCGAGTCT GGTATCTACC CCGCTGTTAA 950 

CCCACTGGAA TGCGCCTCGC GTATCATGGA CCCCGACGTT ATCAGTGTGG 1000 

AT C ACT AC AA TGTTGCACAA GATGTGGTAC AGATGCTCAC CAAGTACAGG 1050 

GAATTACAGG ATATCATTGC TGTCCTTGGT ATCGACGAGC TAAGCGAGGA 1100 

GGACAAACTT ATCGTGGACC GTGCGCGTAA GTTGGTGAAG TTCCTCTCCC 1150 

AGCCATTCCA AGTTGCTGAG GTCTTCACAG GAATGACTGG CCATTACGTG 1200 

CAGTTGGATG ACACCATCGA TTCCTTTTCT GGTCTCCTCA TGGGTACG 1248 



2) INFORMATION FOR SEQ ID NO: 2191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2191 

GAGCTGATTG TAAGTCATCG ACCACTTGAT ATATGAATAC ATCTAACAGT 50 

AGTAGAACAA CATCGCCAAG GCTCACGGTG GTTACTCCGT CTTCACTGGT 100 

GTCGGTGAGC GTACTCGTGA GGGTAACGAT CTGTACCACG AAATGCAGGA 150 

GACTGGTGTC ATTCAGCTCG ACGGCGAATC CAAGGTGTCT CTTGTGTTCG 2 00 

GTCAGATGAA CGAGCCCCCA GGTGCTCGTG CCCGTGTCGC CCTTACTGGT 250 

CTGACCATCG CCGAATACTT CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 3 00 

CTTCATTGAC AACATTTTCC GTTTCACCCA GGCCGGTTCC GAGGTGTCTG 3 50 

CCCTTCTTGG TCGTATCCCC TCTGCCGTCG GTTACCAGCC CACTCTGGCC 4 00 

GTCGACATGG GTGGTATGCA GGAACGTATT ACCACCACCA 44 0 



2) INFORMATION FOR SEQ ID NO: 2192 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1262 bases 
-(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania panamensis 

(B) STRAIN: ATCC 50158 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2192 



GCTGACGGCG CTGGACGTGA CGGAGGACCT 
CGCTGGAGAT CGTGCAGCAC TTGGATGCGA 
ATGCAGACGA CGGACCTGCT GAAGCTGAAG 
CGGCAACATC TCCGTGCCGG TGGGCCGCGA 
ACGTGCTGGG CGACGCGATT GACCACCGCG 
CGCATGGCGA TCCACGCCGA GGCGCCGAAG 
GGACACGATC CTGACGACCG GCATCAAGGT 
ACTGCAAGGG CGGCAAGATC GGCCTGTTCG 
ACTGTGATCA TCATGGAGCT GATCAACAAC 
CTTCTCCGTG TTCGCCGGCG TTGGCGAGCG 
TGTACCTGGA GATGATGCAG TCAAAGGTGA 
AAGTGCGTGC TTGTGTACGG GCAGATGAAC 
GCGCGTTGCG CAGTCTGCGC TGACGATGGC 
AGGGCCAGAA CGTGCTGCTG TTCATCGACA 
GCGAACTCCG AGGTGTCTGC GCTGCTGGGC 
TTACCAGCCG ACGCTTGCGG AGGATCTTGG 
CGTCGACGAC GAAGGGATCG ATTACGTCTG 
GCGGATGATA TCACGGACCC CGCGCCCGCG 
CGCGACGACT GTGCTGGACC GCGCGGTGGC 
CCGTGAACCC GCTGGAGTGC GCGTCGCGCA 
GACGTGGACC ATTACAACGT TGCACAGGAT 
GTACAAGGAG CTGCAGGACA TCATTGCGGT 
GCGAGGAGGA CAAGGTCGTG GTGGACCGCG 
CTGTCGCAGC CGTTCCAGGT TGCGGAGGTG 
CTACGTGCAG CTGAGCGACA CGGTGGAGTC 
GGTCGTACGA CC 



CGGCCGCGAT GAGCCGCTGA 50 

ACACCGGCCG CTGCATTGCG 100 

TCGAAGGTTG TGTCGACCGG 150 

GACGCTGGGC CGCATCTTCA 200 

GCCCCGTGTG CGAGAAGATG 250 

CTGGCGGACC AGGCTGCGGA 3 00 

GATCGACCTG ATTCTGCCCT 350 

GCGGTGCCGG TGTGGGCAAG 4 00 

GTCGCGAAGG GGCACGGCGG 4 50 

CACGCGCGAG GGCACGGACC 50 0 

TTGACCTGAA GGGCGAGTCG 550 

GAGCCCCCGG GTGCGCGCGC 600 

CGAGTACTTC CGCGACGTGG 650 

ACATCTTCCG CTTCACGCAG 70 0 

CGCATCCCGG CCGCCGTGGG 750 

CATGCTGCAG GAGCGCATTA 8 00 

TGCAGGCTGT GTACGTGCCT 850 

ACGACGTTCT CGCACCTGGA 900 

GGAGTCGGGC ATCTACCCTG 950 

TCATGGACCC CGATGTGATC 1000 

ATCGTCCAGA TGCTGACCAA 1050 

GCTTGGCATC GACGAGCTGA 1100 

CGCGCAAGGT GACGCGGTTC 1150 

TTCACCGGCA TGACGGGCCA 12 00 

GTTCTCTGGC CTGCTGATGG 1250 

1262 



2) INFORMATION FOR SEQ ID NO: 2193 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2193 

TTCCGATGGT CAAATGTACG ATTGATATTC CTTCCAGCCA GTCAGGATAA 50 

CAGCTGATAC CAGTTGCAAA TAGGCCCCAG ACTCGTGAGC ACTTGTTGCT 100 

TGCCCGTCAG GTTGGTGTCC AGAAGATCGT TGTCTTCGTC AACAAGGTTG 150 

ACGCTGTCGA TGACCCTGAG ATGTTGGAGC TTGTTGAGCT CGAGATGCGT 2 00 
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GAGCTCCTCA ACACTTACGG TTTCGAGGGA 
CGGTTCCGCC CTGTGCGCTC TCGAAGGCCG 
AGCGTATTGA CTCCCTCCTC GAGGCCGTTG 
CAGCGTGACT TGGACAAGCC CTTCCTGATG 
CATTGGTGGT CGTGGTACCG TCGCCTCTGG 
TCAAGAAGGA TACCGAAGTT GAAATTCACG 
ACCAAGGTCA CCGACATTGA GACCTTCAAG 
TGCTGGTGAC AACTCCGGTC TTCTCCTCCG 
TTCGTCGTGG TATGGTCATC GCTGCCCCTG 
AAGTTCATGG TCTCCATGTA CGTCTTGACT 
GAACGGCTTC GGTGCCAACT ACCGCCCCCA 
GTAAGTTTCG AACTATTTGA TTCATTGATC 
AGACGAGGCT TGCGACCTTC ATTTCCCTGA 
ACGTCATGCC CGGTGACAAC GTCGAAATGG 
GTTGCTGCTG AG 
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GAGGAGACCC CTATCATCTT 250 

CCGCGAGGAC ATTGGTACTC 3 00 

ACACTTGGAT CCCTACCCCC 3 50 

TCCATTGAGG AAGTTTTCTC 4 00 

TCGTGTCGAG CGTGGTCTCC 4 50 

GTGCTGATGG TATTCTGAAG 500 

AAGAGCTGCG ATGAGTCTCG 550 

TGGTATCCGT CGTGAGGATG 600 

GCTCCATCAA GGCCTCCAAG 650 

GAGGCTGAAG GTGGCCGCAA 70 0 

GGCTTTCATC CGCACTGCTG 750 

ACGTCCCTAA CTGTTACTTT 800 

TGAGGCCGAC AAGGACCGCC 850 

TCCTCAACCT CAACAACCCC 900 

912 



2) INFORMATION FOR SEQ ID NO: 2194 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2194 

TCATCGTCGT CGCCGCTTCT GACGGTCAGA TGCCCCAGAC CAGAGAGCAC 50 

TTGCTTCTCG CCCGTCAGGT CGGTATCCAG AAGATTGTCG TCTTCGTCAA 100 

CAAGGTCGAC GCCATCGAGG ACAAGGAGAT GCTTGAGCTC GTCGAGATGG 150 

AGATGCGTGA GCTCCTCAGC AGCTACGGCT TCGAGGGTGA CGAGACCCCC 2 00 

ATCGTCATGG GTTCCGCTCT TTGTGCCCTT GAGAACCGCC AGCCCGAGAT 2 50 

TGGAACCACC CAGATCGACA ACCTGATGAA CGCTGTCGAC GAGTGGATCC 3 00 

CCACTCCCCA GAGAGATCTT GAGAAGCCCT TCCTCATGTC CGTTGAGGAT 3 50 

GTCTTCTCTA TCCCCGGTCG TGGTACTGTC GTTTCTGGCC GTGTTGAGCG 400 

TGGTACCCTG AAGAAGGATT CCGAAATCGA GCTTGTCGGC AAGAACAAGG 4 50 

TCCCCATCAA GACCAAGGTC ACCGACATCG AGACCTTCAA GAAGTCTTGC 500 

GACGAGTCCC GCGCTGGTGA CAACTCCGGT CTTCTGCTCC GTGGTATCAA 550 

GCGTGAGGAT GTCAACCGTG GTATGGTTGT CGTCAAGCCC GGTACCGTCA 600 

CCTCGCACAA GAAGTTCCTC GTCTCCATGT ACGTCCTGAC CAAGGAGGAG 650 

GGTGGTCGTC ACACTGGTTT CCACGGAAAC TACCGTCCCC AGATCTTCAT 700 

CCGTACCGCT GGTAAGTCCT GACTTTGAAC TGCTGACCAA TTTTCGCATC 750 

TCTAACATGT TTTACAGACG AGGCTGCCGC TATTGACTGG CCCGAAGGCA 800 

CCGAGGACGC TGACTCCAAG ATGGTCATGC CCGGTGACAA CGTCGAGATG 850 

GTCTGCTCTC TTCACAGACC TCTTGCCGTT GAACAGG 887 



2) INFORMATION FOR SEQ ID NO: 2195 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia parva 

(B) STRAIN: ATCC 10784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2195 

TGACGGCCAA ATGTAGGACT CTTGCGCGGG ACTGACTGAT TGGGGGGAAT 50 

CCATTTTTTC TTTTTTCTTT TTCTTTTTGA GTACATGATT ATACTAATAT 100 

TTGGATAACG TACCAGGCCT CAAACACGTG AACATTTACT CCTTGCCCGA 150 

CAAGTCGGTG TCCAGAAGAT CGTCGTTTTC GTAAACAAGG TCGATGTTCT 200 

CGAAGATAAG GAGATGTTGG AGCTTGTCGA GTTGGAAATG AGAGAGCTCT 250 

TGAACACCTA CGGATTCGAG GGTGAGGAAA CGCCCATCAT CTTCGGTTCT 300 

GCCCTTTGTG CCATGGAGGG CCGCGAGCCT GAGCTGGGCG AGAAGAGAAT 350 

TGATGAATTG CTCGACGCTG TTGATAGCTG GATCCCTACG CCGCAACGTG 400 

ATACGGAAAA ACCCTTCCTA ATGTCCATTG AGGAAGTGTT CTCCATCTCC 450 

GGTCGTGGAA CCGTTGCCTC CGGCCGTGTT GAGCGTGGTG TCCTCAAGAA 500 

GGATTCCGAA GTCGAGCTTG TTGGCGGCGG CGTCGCCCCA ATCAGGACCA 550 

AGGTAACCGA TATCGAAACC TTCAAGAAGT CCTGCGACGA GTCCAGGGCT 600 

GGAGACAACT CTGGCCTTTT GTTGCGTGGT GTCAAGCGTG AGGATATCCG 650 

CCGTGGTATG GTCGTTGTCG TTCCTGGCAG CGTCAAGGCC CACGACAAGT 700 

TCTTGGTGTC CATGTATGTT CTGACCGAAG CCGAGGGTGG TCGCCGAACT 750 

GGATTCGGCC AAAACTATCG TCCTCAAATG TTTATCCGCA CAGCTGGTAC 800 

GTAACGTATA ATGCCTCTCC TCTTCATATA TACCACCCCC CCCACCACTG 850 

ACTCCCTGAC TCTTCGATTA CAGACGAGGC CGCCGATCTC AGCTTCCCTG 900 

ACGCAGCAGA CGAAACCAAA CTGGTTATGC CCGGTGACAA CGTCGAGATG 950 

ATCCTCAAGA CACACCGCCC CATAGCTGCC GAAG 984 



2) INFORMATION FOR SEQ ID NO: 2196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2196 

GCCGCCTCTG ACGGCCAAAT GCCCCAGACC CGTGAGCACT TGCTGCTTGC 50 

TCGCCAGGTC GGTGTTCAGA AGATTGTTGT TTTTGTCAAC AAGGTCGATG 100 

CCGTCGAGGA CAAGGAGATG TTGGAGCTCG TTGAGATGGA GATGCGTGAG 150 

TTGCTCAGCA GCTATGGCTT CGAGGGCGAT GAGACTCCCA TTGTCATGGG 200 
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CTCCGCTCTC TGCGCCATCG 
GTATTGATGA GCTGCTCGAG 
CGTGATACCG ACAAGCCCTT 
TGCTGGCCGT GGTACCGTCG 
AGCGCGATGC CGAAGTCGAG 
ACCAAGGTTA CCGACATTGA 
CGCCGGTGAC AACTCGGGTC 
TCCGCCGTGG CATGGTCGTT 
AAGTTCCTCG TCTCCATGTA 
CACGGGCTTC GGCGAGAACT 
ACGAGTCATG CGCGCTGCAC 
AAGCTTGTTA TGCCTGGTGA 
GCCACA 



AGGGCCGTGA 
GCTGTCGACA 
CCTCATGGCT 
TTTCTGGCCG 
CTGGTTGGCA 
AACCTTCAAG 
TCCTCCTCCG 
TCTGTTCCTG 
TGTGTTGAGC 
ACAGGCCACA 
TGGCCAGAAG 
TAACGTTGAG 



GCCCGAAATT GGTGTCAACC 
CCTGGATCCC CACCCCTCAG 
GTTGAGGACG TCTTCTCCAT 
TGTCGAGCGA GGTGTCTTGA 
AGGGCACTGC GCCCATCAAG 
AAGTCGTGTG AAGAGTCTCG 
TGGTGTCAAG CGTGACGACG 
GACAAGTCAA GGCCCACAAG 
AAAGAGGAAG GTGGCCGCCA 
AATGTTCATC CGTACTGCTG 
GTACCCCAGA TGCTCACGAC 
ATGGTTTGTG AGCTTCACGT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 2197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium monili forme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2197 

ACCCGTGAAC ACTTGCTCCT CGCTCGTCAG GTTGGTGTTC AGCGAATTGT 50 

CGTCTTTGTC AACAAGGTCG ATGCCATTGA TGACCCCGAG ATGCTTGAGC 100 

TCGTCGAGAT GGAGATGCGC GAGCTTCTTA ACACCTATGG CTTCGAAGGC 150 

GACGACACTC CCGTCATCAT GGGCTCGGCT CTCATGTCTC TCCAGAACCA 2 00 

GCGCCCTGAG ATTGGCACCG AGAAGATTGA TGAGCTTCTT GCTGCCGTCG 2 50 

ACGAGTGGAT CCCAACCCCC GAGCGTGACC TTGACAAGCC CTTCCTTATG 3 00 

TCCGTCGAGG ATGTCTTCTC CATTGCTGGC CGTGGTACCG TCGTGTCTGG 3 50 

CCGTGTGGAG CGTGGTGTTC TGAAGCGTGA CCAGGAGATC GAGCTTGTTG 4 00 

GAAAGGGTCA GGAGGTTATC AAGACCAAGG TTACCGACAT CGAGACCTTC 4 50 

AAGAAGTCTT GTGAGCAGTC CCAGGCTGGT GACAACTCTG GTCTCCTCAT 5 00 

CCGAGGTGTT CGCCGTGAGG ATGTCCGCCG TGGTATGGTC GTCTGCGCTC 550 

CTGGCACCGT GAAGTCTCAC ACCCAGTTCC TCGCTTCCCT CTATGTCCTC 6 00 

TCCAAGGAGG AGGGTGGCCG ACACACCGGT TTCCAGGAGC ACTACCGACC 6 50 

CCAGCTCTAC CTCCGAACCG CAGATGAGTC CATTGACCTG ACTTTCCCTG 700 

AGGGTACTGA GGATGCCTCC AGCAAGATGG TCATGCCTGG CGACAACACC 750 

GAGATGGTTG TCACCATGGG TCACCCCAAT GCCATCGAGG TTGGTCAGC 799 



2) INFORMATION FOR SEQ ID NO: 2198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2198 

CTCTGACGGT CAGATGCCCC AGACCCGTGA GCACTTGCTG CTTGCCCGTC 50 

AGGTCGGTGT C CAGAAG ATT GTCGTCTTCG TCAACAAGGT CGATGCCATT 100 

GACGACCCTG AGATGCTTGA GCTCGTCGAG ATGGAGATGC GTGAGCTCCT 15 0 

CAACACCTAC GGCTTCGAGG GTGACGAGAC CCCTGTCATC ATGGGCTCTG 200 

CTCTCATGTC CCTCCAGAAC CAGCGCCCCG AGATCGGTAG CCAGAAGATT 250 

GACGAGCTCC TTGCCGCCGT TGACGAGTGG ATCCCTACCC CCGAGCGTGA 3 00 

CCTTGACAAG CCCTTCCTCA TGTCCGTTGA GGATGTCTTC TCCATTGCCG 3 50 

GCCGTGGTAC CGTCGTCTCT GGC CGTGTCG AGCGTGGTGT CCTGAAGCGC 400 

GACCAGGAGA TTGAGCTCGT CGGCAAGGGT AACGAGGTCA TCAAGACCAA 4 50 

GGTCACCGAC ATTGAGACCT TCAAGAAGTC TTGCGAGCAG TCCCAGGCTG 500 

GTGACAACTC TGGTCTCCTC ATCCGAGGTG TCCGCCGTGA GGATGTCCGC 550 

CGTGGTATGG TCGTCTGCGC CCCCGGCACT GTCAAGTCCC ACACTCAGTT 60 0 

CCTTTCTTCC CTCTACGTCC TCACCAAGGA GGAGGGTGGC CGACACACTG 650 

GCTTCCAGGA GCACTACCGA CCCCAGCTCT ACCTCCGAAC TGCTGATGAG 700 

TCCATCGACC TGACCTTCCC CGAGGGTACC GAGGACGCCA GCAGCAAGAT 750 

GGTCATGCCC GGTGACAACA CCGAGATGGT CATCACCATG GGCCACCCCA 8 00 

ACGCCATTGA GGTCGGTCA 819 



2) INFORMATION FOR SEQ ID NO: 2199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2199 

TGCTGACGGC CAAATGTAAG ACGCCGCGAG GGAGTGCTGA AGGTTTTATG 50 

CTTTTTAGGC CCTTTGTTTC TGAGAGCATG ATGATACTAA TACTCGGAAA 100 

CGTATCTATT AGGCCTCAAA CACGTGAGCA TTTGCTCCTT GCCCGACAGG 150 

TCGGTGTCCA AAAGATCGTC GTTTTCGTGA ACAAAGTCGA CGCCCTTGAG 2 00 

GACAAGGAGA TGTTGGAGCT TGTCGAGTTA GAAATGAGAG AGCTCTTAAA 250 

CACCTACGGA TTTGAGGGTG AAGAGACACC CATCATCTTT GGTTCTGCCC 300 

TTTGCGCCAT GGAAGGCCGT GAGCCTGAGT TGGGAGAAAA GAAAATTGAT 3 50 

GAATTGCTGG AGGCTGTTGA TACTTGGATC CCAACACCAC AACGTGATAC 400 

CGAAAAACCT TTCTTGATGT CCGTTGAGGA AGTATTCTCT ATCTCCGGTC 450 

GTGGAACCGT TGCCTCCGGT CGTGTTGAGC GCGGTGTCCT CAAGAAGGAT 500 
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TCAGAAGTCG AGCTAATTGG GGGCGGCTCC 
AACTGATATC GAAACTTTCA AGAAATCCTG 
ACAACTCCGG TCTTTTATTG CGTGGTATCA 
GGTATGGTAG TTGCCGTTCC TGGCAGCGTC 
GGTGTCGATG TATGTCCTGA CCGAAGCTGA 
TCGGCCAGAA CTATCGTCCT CAAATGTTCA 
AAATGGGGCC CCTTTTCATA ATCCTTTCTT 
ATCTCTCTCT CTGTTTCTTT TCAACTCGCC 
CCCCTTTGAT TATAGACGAA GCCGCCCATC 
GATGAAAGCA AACTCGTTAT GCCTGGTGAC 
GACACACCGC CCCGTGGCTG CTGAG 
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ACCCCCATCA GGACGAAGGT 550 

TGACGAGTCT AGAGCTGGGG 6 00 

AGCGTGAAGA TATCCGCCGT 650 

AAGGCCCACG ACAAGTTCTT 700 

GGGTGGTCGC CGAACCGGAT 750 

TCCGCACAGC TGGTATGTCA 80 0 

TTTTTCCTTT TCCTCTCTCT 85 0 

TGATTCACGA AATTAACTAA 900 

TCAGCTTCCC TAGTGGAGCA 950 

AACGTCGAGA TGATCCTCCA 10 0 0 

1025 



2) INFORMATION FOR SEQ ID NO: 2200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kocuria kristinae 

(B) STRAIN: ATCC 27570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2200 

AGCACGTGCT GCTCGCCCGC CAGGTCGGCG TGCCGACCCT GCTGGTCGCC 50 

CTGAACAAGG CCGACATGGT CGAGGACGAG GAGCTGCTGG ACCTCGTCGA 10 0 

GATGGAGGTC CGCGAGCTGC TGTCCTCCCA GGAGTTCGAC GGCGACAACG 150 

CCCCCGTCAT CCGCGTCTCC GCGCTGAAGG CGCTGGAGGG CGACGAGAAG 20 0 

TGGGTCAAGT CCATCGAGGA GCTCATGGAG GCCGTGGACG AGTACATCCC 2 50 

GGACCCCGTG CGCGACAAGG ACAAGCCGTT CCTGATGCCC ATCGAGGACG 3 00 

TCTTCACCAT CACCGGGCGC GGCACCGTGG TGACCGGTCG CGCCGAGCGC 3 50 

GGGACCCTGG CCCTGAACTC CGAGGTCGAG ATCGTCGGCA TCCGCCCGAT 4 00 

CCAGAAGACC ACGGTCACCG GGATCGAGAT GTTCCACAAG CAGCTCGACG 4 50 

AGGCCTGGGC CGGCGAGAAC TGCGGTCTGC TGCTGCGCGG CCTGAAGCGC 500 

GACGACGTCG AGCGCGGCCA GGTCGTGGTG AAGCCGGGTT CCATCACCCC 550 

GCACACCAAC TTCGAGGCGA ACGTCTACAT CCTGTCCAAG GACGAGGGTG 600 

GGCGTCACAA CCCGTTCTAC TCGAACTACC GTCCGCAGTT CTACTTCCGG 650 

ACCACCGACG TCACCGG 667 



2) INFORMATION FOR SEQ ID NO: 2201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio mimicus 

(B) STRAIN: ATCC 33653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2201 

TGCTGCAACA GATGGTCCAA TGCCACAAAC TCGTGAGCAC ATCCTGCTGG 50 

GTCGCCAAGT AGGTATTCCT TACATCATCG TATTCATGAA CAAATGTGAC 100 

ATGGTTGACG ATGAAGAGCT TCTAGAGCTG GTTGAGATGG AAGTTCGTGA 150 

GCTTCTGTCT GAGTACGATT TCCCAGGTGA TGACCTGCCA GTAATCCAAG 20 0 

GTTCAGCACT AGGCGCGCTA AACGGCGAAG CACAGTGGGA AGCGAAGATT 250 

GTTGAACTAG CAGAAGCACT AGATTCATAC ATTCCAGAGC CAGAGCGTGC 3 00 

AGTAGACATG GCATTCCTGA TGCCAATCGA AGACGTATTC TCAATCCAAG 350 

GTCGTGGTAC AGTAGTAACT GGCCGTATCG AGCGCGGCAT CCTGAAAGTG 400 

GGTGACGAAG TTGCGATCGT TGGTATCAAA GACACAGTAA AAACTACCTG 4 50 

TACAGGTGTA GAAATGTTCC GTAAGCTGCT TGACGAAGGT CGTGCAGGTG 500 

AGAACGTTGG TGCACTGCTA CGTGGTACTA AGCGTGAAGA AGTAGAGCGT 550 

GGTCAAGTAC TGGCGAAGCC AGGTTCAATC ACCCCACACA CTAAGTTCGA 60 0 

ATCAGAAGTA TACGTACTGT CAAAAGACGA AGGTGGCCGT CATACTCCAT 650 

TCTTCAAAGG TTACCGTCCA CAGTTCTACT TCCGTACAAC TGACGTAACA 700 

GGCAGCATCG AGCTTCCAGA AGGCGTAGAA ATGGTAATGC CAGGCGACAA 750 

CATCAAGATG GTTGTAGACC TGATTGCA 778 



2) INFORMATION FOR SEQ ID NO: 2202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2202 

AACGCTGACC CTGCAGGTTA TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 50 

GTGCATTTAT CGATGCAGAA CACGCACTGG ACCCGGTCTA TGCCCGTAAG 100 

CTTGGCGTTG ATATCGATAA CCTGCTGTGT TCTCAGCCGG ATACCGGTGA 150 

ACAAGCGCTG GAAATCTGTG ATGCACTGGC GCGCTCCGGT GCGGTTGACG 200 

TTATCGTTGT CGACTCCGTT GCCGCATTGA CGCCGAAGGC AGAAATCGAA 250 

GGCGAGATTG GCGACTCTCA CATGGGCCTT GCGGCGCGTA TGATGAGCCA 3 00 

GGCGATGCGT AAGCTGGCCG GTAACCTGAA GCAGTCCAAC ACGCTGCTGA 3 50 

TTTTCATCAA CCAGATCCGT ATGAAGATTG GCGTTATGTT CGGTAACCCG 4 00 

GAAACCACCA CC 412 



2) INFORMATION FOR SEQ ID NO: 2203 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:1.2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2203 

TTTAGATCCA TCTTATGCTA GAAATTTAGG TGTTGATATA GATAACCTAA 50 

TAGTTTCTCA ACCAGATACA GGAGAACAGG CTTTAGAGAT AACAGAAGCT 100 

TTAGTAAGAT CAGGAGCAGT AGATGTTATA GTTGTAGACT CTGTAGCAGC 150 

TTTAGTTCCT AGGGCAGAAA TAGAAGGAGA AATGGGAGAC T CACATGT AG 200 

GTCTTCAAGC AAGACTTATG TCTCAAGCCC TAAGAAAATT AGCAGGATCT 250 

ATAAATAAAT CTAAGTGTGT AGCTATATTT ATAAACCAAT TAAGAGAAAA 3 00 

GGTTGGTATA ATGTTTGGAA ATCCAGAAAC AACTCCT 337 



2) INFORMATION FOR SEQ ID NO: 2204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN : ATCC 29684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2204 

AAAGCAAGGC GGTACTGCAG CATTTGTTGA TGCTGAGCAT GCACTAGATC 50 

CAAAATATGC AAAGCTTTTA GGTGTTGATG TTGATAATCT GATCGTGTCA 100 

CAGCCGGATA CGGGTGAGCA AGCTTTAGAG ATTGCTGATA TGTTGGTACG 150 

TTCTGGAGGA GTTGATATTG TAGTAATTGA CTCTGTTGCT GCACTTACGC 20 0 

CAAAGGCAGA GATTGAGGGT GACATGGGCG ACTCGCACAT GGGCTTACAA 2 50 

GCAAGATTAA TGTCACAAGC ACTAAGAAAA CTAACGGCAA ATATCAAGCG 300 

CTCAAATACT CTAGTGATAT TCATTAACCA AATTCGTATG AAGATCGGGG 3 50 

TTATGTTTGG TAACCCTGAA ACTACAACT 3 79 



2) INFORMATION FOR SEQ ID NO: 2205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2205 



ACTTGACCCA GTATATGCAA GGGCTCTTGG AGTGGATATA GACAACCTAG- 50 

TCATATCTCA GCCAGATACA GGAGAACAGG CCCTAGATAT AGCAGAGTCC - 100 

CTTATAAGAT CAGGAGCTGT AGATATACTA GTAATAGACT CAGTAGCTGC - 150 

CCTAGTACCT AAGGCAGAAA TAGAAGGTGA CATGGGAGAT TCTCACGTAG - 200 

GTCTACAGGC TAGACTTATG TCACAGGCAC TTAGAAAATT GACTGGATCT - 250 

ATAAAGAAGT CAAACTGTGT TGTTATATTT ATCAACCAGT TG AG AGAAAA - 3 00 

AGTAGGGGTT ATGTTCGGTA ATCCAGAGAC AACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2206 



TCTTGATGCT GGATATGCAA AAAACCTTGG AGTAGATGTA GAAAATTTAA - 50 

TTATTTCTCA ACCTGATACA GGTGAGCAAG CCTTAGAAAT AACTGAAGCT- 100 

CTTGTAAGAT CTAACGCTGT TGATTTAATT ATTATAGACT CAGTTGCCGC - 150 

ACTTGTACCA AAAGCAGAAA TCGATGGTGA CATGGGAGCT GC ACAAATAG - 2 00 

GTCTTCAAGC AAGACTTATG TCTCAAGCTC TTAGAAAATT AACTGGGGCA- 250 

ATCAACAAGT CAAAATGTAC CGTTGTATTT ATTAACCAAC TT AG AG AAAA - 3 00 

AGTTGGTATC ATGTTTGGTA ACCCAGAAAC TACAACA 3 37 



2) INFORMATION FOR SEQ ID NO: 2207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 
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(B) STRAIN: ATCC 33672 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CTCACGTTGC AAGTTATTGC AGCAGCACAA 
ATTTATCGAC GCTGAACATG CGCTAGATCC 
GTGTTGATAT CGATAACCTT CTATGTTCTC 
GCATTAGAGA TTTGTGATGC ACTGACGCGT 
TATCGTTGAC TCCGTGGCCG CATTAACACC 
AAATCGGTGA CTCACACATG GGCTTAGCGG 
ATGCGTAAAT TAGCGGGTAA CTTAAAGAAC 
CATTAACCAA ATCCGTATGA AGATTGGCGT 
CCACTACA 



ID NO: 2207 

CGTAGCGGAA AAACCTGTGC 50 

AATCTATGCG AAAAAACTGG 10 0 

AACCTGATAC TGGTGAGCAA 15 0 

TCAGGCGCTG TTGATGTCAT 2 00 

AAAAGCTGAA ATTGAAGGTG 250 

CTCGTATGAT GAGCCAAGCG 300 

TCGAATACAC TTTTAATCTT 350 

TATGTTTGGT AACCCAGAAA 400 

408 



2) INFORMATION FOR SEQ ID NO: 2208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2208 

GCTGCAGGTG ATTGCCGCTG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA 50 

TCGATGCGGA ACACGCGCTT GACCCTGTTT ACGCACGCAA GCTGGGCGTC 100 

GATATCGATA ACCTGCTTTG TTCTCAGCCG GATACCGGCG AGCAGGCGCT 150 

GGAAATCTGT GACGCGCTGG CGCGTTCAGG CGCGGTGGAC GTCATTGTGG 200 

TCGACTCCGT AGCGGCGCTA ACGCCGAAAG CGGAAATCGA AGGCGAAATT 250 

GGCGACTCTC ACATGGGCCT CGCGGCGCGT ATGATGAGCC AGGCGATGCG 3 00 

TAAGCTGGCG GGGAACCTAA AACAGTCCAA CACGCTGTTG ATTTTCATCA 3 50 

ACCAGATCCG TATGAAGATT GGCGTGATGT TCGGTAACCC GGAAACCACC 4 00 

ACC 403 



2) INFORMATION FOR SEQ ID NO: 2209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
{ D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Salmonella 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ 

GACCCTGACG CTGCAGGTGA TTGCCGCTGC 
GTGCGTTTAT CGATGCGGAA CACGCGCTTG 
CTGGGCGTCG ATATCGATAA CCTGCTCTGC 
GCAGGCGCTG GAAATCTGTG ACGCGCTGGC 
TCATTGTGGT CGACTCCGTA GCGGCGCTAA 
GGCGAAATCG GCGACTCTCA CATGGGCCTC 
GGCGATGCGT AAGCTGGCGG GGAACCTGAA 
TTTTCATCAA CCAGATCCGT ATGAAGATTG 
GAAACCACCA CC 



PCT/CA00/01150 

choleraesuis subsp. choleraesuis 



ID NO: 2209 

GCAGCGTGAA GGTAAAACCT 50 

ACCCTGTTTA CGCACGCAAG 10 0 

TCTCAGCCGG ATACCGGCGA 150 

GCGTTCAGGC GCGGTGGACG 200 

CGCCGAAAGC GGAAATCGAA 250 

GCGGCGCGTA TGATGAGCCA 3 00 

ACAGTCCAAC ACGCTGTTGA 350 

GCGTGATGTT CGGTAACCCG 4 00 

412 



2) INFORMATION FOR SEQ ID NO: 2210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytxcus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2210 

ATTAGATCCT GTTTACGCAG AAGCTTTAGG CGTAGATATA CAAAATTTAT- 5 0 

ATTTATCTCA ACCTGATCAT GGGGAACAAG GTTTAGAAAT TGCCGAAGCA- 100 

TTTGTTAGAA GTGGCGCTGT TGATATCGTT GTGGTCGATT CAGTTGCTGC - 150 

GCTTACACCT AAAGCTGAAA TTGAAGGTGA AATGGGAGAT ACGCACGTTG - 200 

GTTTGCAAGC ACGTCTTATG TCCCAAGCCT TGAGAAAGCT TTCCGGTGCA- 250 

ATTTCAAAAT CAAATACAAC AGCAGTATTT ATCAACCAAA TCCGTGAAAA- 3 00 

AGTTGGTGTG ATGTTCGGTA ATCCTGAAGT TACACCA 337 



2) INFORMATION FOR SEQ ID NO: 2211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 
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(xi) SEQUENCE DESCRIPTION: SEQ 

GACACTGACA TTACAGGTTA TCGCCGCCGC 
GTGCATTTAT CGATGCCGAA CATGCCCTTG 
TTGGGTGTAG ATATTGATAA CCTACTGTGT 
GCAGGCACTG GAAATTTGTG ATGCGCTGAC 
TTATCATCGT TGACTCCGTA GCGGCATTGA 
GGTGAAATTG GCGATTCTCA TATGGGCCTT 
GGCTATGCGT AAGCTGGCGG GTAACCTGAA 
TTTTTATCAA CCAAATCCGC ATGAAAATTG 
GAAACCACTA CC 



ID NO: 2211 

ACAGCGTGAA GGCAAAACGT 50 

ACCCAATCTA TGCCAAGAAA 10 0 

TCTCAGCCAG ATACTGGCGA 150 

TCGCTCTGGT GCGGTTGACG 2 00 

CACCAAAAGC TGAAATTGAA 2 50 

GCCGCGCGTA TGATGAGCCA 3 00 

GAATGCGAAT ACCTTACTGA 3 50 

GCGTGATGTT TGGTAACCCA 4 00 

412 



2) INFORMATION FOR SEQ ID NO: 2212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2212 

CACGCTCACC TTGCAAACCA TCGCGGAAAT GCAAAAACTG GGCGGCACCT 5 0 

GCGCGTTTAT CGACGCCGAG CACGCACTGG ACGTCACGTA CGCGCAAAAG 100 

CTGGGCGTCA ACCTGAGCGA TCTGCTGATC TCGCAACCGG ACACCGGCGA 15 0 

ACAAGCGCTG GAAATCTGCG ACGCCCTGGT GCGTTCCGGT TCGGTGGACA 200 

TGGTCGTGAT CGACTCGGTC GCCGCGCTGA CCCCGCGCGC CGAGATCGAA 25 0 

GGCGACATGG GCGATTCGCT GCCAGGTTTG CAGGCACGTT TGATGTCGCA 3 00 

AGCACTGCGC AAGCTTACCG GTTCGATCAA CCGCACCAAC ACCCTGGTCA 350 

TCTTCATCAA CCAGATCCGC ATGAAAATCG GCGTCATGTT CGGCAGCCCG 40 0 

GAAA 404 



2) INFORMATION FOR SEQ ID NO: 2213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2213 
CGTGCCATTG ACATGATTTC CGAAGAAGAC GCTGAAGGCA CG 
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2) INFORMATION FOR SEQ ID NO: 2214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2214 

CAACTACATT ACGTTCTGCA ACACAAGGTC GTGGTACTTT CAGTATGACA 50 
TTTGACCACT ATGAAGATGT TCCTAAGAGC ATTGCAGAAG AAATCATCAA 100 
GAAAAATGGC GGTAACGGAG AATAA 12 5 



2) INFORMATION FOR SEQ ID NO: 2215 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2215 

CGACTCAAAT GCGTTCTATG TCTCAAGGTC GTGCGACATA CTCAATGGAA 50 
TTTGCTAAAT ATGCTGAAAC TCCACGTAAC GTGGCTGAAG GCATCATCGC 10 0 

TAAATTCCAA GCTGGCGGTA AAAAAGGTGA CGACGAGTAA 140 



2) INFORMATION FOR SEQ ID NO: 2216 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2216 

CCGGTGACCT GCGTTCTAAG ACGCAGGGTC GCGCTGTCTA CTCCATGGAG 
TTCGACAGCT ACGCCGAGGT TCCGCGCGCG GTCGCGGATG AGATCGTCGG 
CAAGTCTCGG GGCAACTGA 



2) INFORMATION FOR SEQ ID NO: 2217 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2217 

CGACCGACGT TCGCTCCATG TCCCAGGGTC GCGCAAGCTA CTCTATGGAA 
TTCAAAAAAT ACAACACAGC TCCGGCGCAC ATCGCTGAAA CTGTATCCAA 
AAAACAAGGC TGA 



2) INFORMATION FOR SEQ ID NO: 2218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2218 

GCGACCTGCG TTCCCGTACC CAGGGCCGTG CAAACTACAC CATGATCTTC 
GACTCCTACG CTGAGGTTCC TACCAACGTG GCAGCTGAGA TCGTGGCAGA 
GCGCAACGGC ACTGCCTAA 



2) INFORMATION FOR SEQ ID NO: 2219 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2219 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCTAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AGTAA 115 



2) INFORMATION FOR SEQ ID NO: 2220 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2220 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 
TCCTGAAGTA TGATGATGCG CCGAACAACG TTGCTCAGGC CGTTATTGAA 
GCCCGTGGTA AATAA 



2) INFORMATION FOR SEQ ID NO: 2221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CAACTCACCT TCGTTCAGGT ACGCAAGGTC 
TTTGACCACT ATGAAGAAGT TCCTAAATCT 
AGCTAATGGT GGA 



PCT/CAOO/01150 

ID NO: 2221 

GTGGTGTATA CACTATGCAA 50 
ATTGCTGAAG AAATCATTAA 100 

113 



2) INFORMATION FOR SEQ ID NO: 2222 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2222 

CGGCGACCTG CGGTCCAAGA CCCAAGGCCG GGCGAACTAC TCCATGGTCT 50 
TCGACTCCTA CGCCGAAGTG CCGGCCAACG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACGG GTCAGTGA 118 

2) INFORMATION FOR SEQ ID NO: 2223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN : Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2223 

CCGGCGACCT GCGGTCCAAG ACGCAAGGCC GGGCGAACTA CTCCATGGTG 5 
TTCGACTCGT ACGCCGAAGT TCCGGCGAAC GTGTCGAAGG AGATCATCGC 10 
GAAGGCGACG GGCGAATAG 11 



2) INFORMATION FOR SEQ ID NO: 2224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2224 

CGGCGACCTG CGGTCCAAGA CTCAAGGCCG GGCGAACTAC TCGATGGTGT 
TCGATTCCTA CGCCGAAGTG CCGGCTCAGG TGTCGAAGGA GATCATCGCG 
AAGGCGACTG GCGAGTGA 



2) INFORMATION FOR SEQ ID NO: 2225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2225 

CGGAGACTTG CGGTCGAAGA CCCAGGGCCG GGCGAACTAC TCCATGGTGT 
TCGACTCCTA CGCCGAAGTG CCGGCGCAGG TGGCGAAGGA GATTATCGCG 
AAGGCAACGG GCGAGTAA 



2) INFORMATION FOR SEQ ID NO: 2226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 6 

GACCGACCTG CGTTCTGCAA CCCAAGGCCG CGCTACTTAC TCTATGGAGT 
TCAAGAAATA TTCTGAAGCT CCTGCCCACA TAGCTGCTGC TGTAACTGAA 
GCCCGTAAAG GCTAA 
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2) INFORMATION FOR SEQ ID NO: 2227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN : ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2227 

AACTTCATTA CGTTCTAACA CGCAAGGTCG CGGTACTTAC ACAATGTACT 
TTGACCACTA TGCAGAAGTT CCTAAATCAA TTGCTGAAGA AATCATCAAG 
AAAAATAAAG GTGAATAA 



2) INFORMATION FOR SEQ ID NO: 2228 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2228 

AACTTCATTA CGTTCTAACA CTCAAGGTCG CGGTACTTAC ACTATGTACT 
TCGATCACTA TGCAGAAGTT CCAAAATCAA TTGCTGATGA TATCATCAAA 
AAAAATAAAG GTGAATAA 



2) INFORMATION FOR SEQ ID NO: 2229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1630 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

1144 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2229 

CGTTCAATGC GTGTTCTAGA CGGTGCAGTT ATGGTTTACT GTGCTGTGGG 50 

TGGTGTTCAG CCTCAGTCTG AAACCGTTTG GAGACAGGCT CAGAAGTACA 100 

AGGTTCCTCG TATTGCTTTC GTTAATAAGA TGGACCGTAC TGGTGCTAAT 150 

TTCCTACGTG TTGTAGAGCA GATTAAGACC CGTCTAAAGG GTAACCCTGT 2 00 

TCCTCTAATG TTACCTATCG GTAAAGAGGA CAGCTTTGTT GGTGTAGTTG 250 

ACCTAATCAA GCGTAAGGCT ATCGACTGGG ATGAGGCATC TCAGGGTATG 3 00 

AAGTTTGAGT ACGTTGACAT TCCAGCAGAT ATGGTTGAGG AAGTTGAAGA 350 

GTGGCGTGCA AAGCTTGTAG AAGCAGCTGC AGAAGCTAAC GACGAGCTGA 400 

TGGATAAATT CTTCGGTGGT GAAGAGCTGA CCGAGGAAGA GATCAAGGCT 450 

GCTCTACGTG AGCGTACTCT TCGCAACGAA ATTATTCCTA TGTGCTGCGG 500 

TTCAGCATTT AAGAACAAGG GTGTTCAGGC AATGCTTGAC GCTGTTGTTG 550 

AGTATCTTCC ATCTCCAGCA GATGTTCCTG CTGTTGAGGG TAAGACCCTA 600 

ACCGGTGAAG CTGATACTCG TAAGGCTGAC GATAAAGAGC CATTCTCTGC 650 

TTTAGCATTT AAGCTAGCAA ATGACCCATT CGTAGGTAAC TTAACATTCT 700 

TACGTTGCTA CTCAGGCTTT ATTAAGTCTG GTGACACTGT AATGAACTCA 750 

GATAAGCAGA AGCGTGAGCG TTTCGGCCGT CTAGTTCAGA TGCACGCTAA 800 

TGCTCGTAAT GAGGTTAGCG AGGTTTATGC AGGTGACATC GTTGCTGCTA 850 

TTGGTCTGAA GGAAACCGTT ACCGGTGATA CCTTATGTGA CCCAGAGCAT 900 

CCAATCATTC TTGAGTCAAT CGACTTTGCA GAGCCAGTTA TCTCTGTAGC 950 

AGTTGAGCCT AAGACCAAGG ACGATCAGGA GAAGATGGCT CTTGCTTTAC 1000 

AGCGTTTAGC AAAAGAAGAT CCTTCATTCC GCGTTCGTAC AGACGAAGAG 1050 

TCTGGCCAGA CCATTATTTC TGGTATGGGT GAGCTTCACC TAGACATCAT 1100 

TGTTGACCGT CTACGCCGTG AGTTCAAGGT TGAGTGTAAT CAGGGTAAGC 1150 

CACAGGTTGC ATACCGTGAG ACCATTAAGA GCAAGGTTGA ACAGCAAGGT 12 00 

AAGTTTGCTC GTCAGTCTGG TGGTCGTGGT CAGTACGGTG ACTGCTGGTT 1250 

ACGTATGGAA CCTCTTGAGC CAGGTAAGGG CTACGAATTC GTGAATGAGA 1300 

TTGTTGGTGG TGTAATTCCT AAGGAATATA TCCCTGCAAT TGATAAGGGC 1350 

TGTCAGGAGC AGATCGCTAA CGGTGTTCTA GCTGGTTTCC CAGTTGTTGA 14 00 

CATCAAGATC ACTGTATTCG ATGGTTCTTA CCACGAAGTT GACTCTTCAG 1450 

AAATGGCATT CAAGATTGCT GCTTCTATGG CATTCAAAGA GGGCTTCAAG 1500 

AAGGCAAATC CTGTTCTTCT AGAGCCTTTA ATGAAGGTAG AAGTTGATAC 1550 

TCCTGAAGAC TACATGGGTG ACGTTATTGG TGACTTAAAC CGTCGTCGTG 1600 

CTATCGTTGA AGGCATGGAA GATGGTCCTA 1630 



2) INFORMATION FOR SEQ ID NO: 2230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2230 
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TTGAAGTGCA ACGTTCATTG CGTGTGCTAG ACGGTTCAGT TACTGTCTTG 50 

GACTCACAAT CAGGTGTTGA ACCACAAACT GAAACAGTTT GGCGTCAAGC 100 

AACAGAATAT CAAGTACCTC GTATTGTATT CTGTAATAAA ATGGATAAGG 150 

TTGGCGCAGA TTTCTTATAT TCTGTCCGAT CATTGCATGA TCGTTTAGAA 2 00 

GCTAATGCAC AACCTATCCA ATTGCCAATT GGTGCTGAAG ATAACTTTGA 250 

AGGTATCATC GACCTTGTGA AGATGAAAGC TGAATTTTAT AAAGATGATT 3 00 

TAGGGACTAC TTTCGAAGAA ACTGAAATCC CAGATGAATA TAAAGAAACA 3 50 

GCTCAAGAAT GGCATAATAA TTTGGTAGAA TCTGTAGCTG ATTTTGATGA 400 

AGATATCATG ATGAAATACT TGGAAGGTGA AGAAATTACA CCTGAAGAGT 450 

TACAAGCAGG TATTCGTAAA GCAACATTAT CTGTTGAATT TTACCCAGTA 500 

TTATGTGGTT CTGCATTTAA AAACAAAGGT GTTCAAATGA TGTTGGATGC 550 

AGTAATTGAT TACTTGCCTT CTCCAACCGA CGTTCCCCCA _ ATTAAAGGGA 6 00 

TCGATCCGAA AACAGATGAA GAAACTGAAC ATCCTGCTGA TGATAGTGAG 650 

CCTTTTTCAT CACTTGCTTT TAAAGTTATG TCAGACCCTT ATGTTGGCCG 700 

CTTAACTTTC TTCCGTGTTT ATTCAGGTGT GTTGGATACA GGTTCTTATG 750 

TATTGAATGC TACTAAGGGT TCACGTGAAC GAATTGGTCG TATTTTGCAA 80 0 

ATGCATGCCA ATTCTCGTTC TGAGATCGAT AAGGTTTATT CAGGTGACAT 850 

TGCAGCTGCT GTAGGCTTGA AGAACACTAC AACAGGGGAT ACCCTTTGTG 90 0 

ATGAGAAAAA TCCAGTTATT TTGGAAACTA TCAACTTCCC TGAACCAGTA 950 

A??CAA^G CTGTTGAACC TAAGTCAAAA GCTGACCAAG ATAAAATGAG 1000 

CGTAGCACTA CAAAAACTTG CAGAAGAAGA CCCATCTTTT AAAGTGGAAA 1050 

c?aa?gctga aactggcgaa actgtaattg ctggtatggg tgaacttcaa iioo 

TTAGACGTTC TTATTGACCG TATGAAGACT GAATTTAAAG TGGATGCCAA 1150 

Sttgotgca ccacaagttt cttatcgtga aactttccgt tcatcaacta 1200 

AAGCTGAAGG GAAATTTATC CGCCAATCTG GTGGTAGAGG TCAATACGGT 1250 

CACGTATGGG TTGAATTTAC TCCAAACGAA GAAGGAGCAG GATTCGAATT 1300 

TAAAAACTCC ATTGTTGGTG GGGTTGTCCC TCGTGACTAT ATACCTGCAG 1350 

TACAAAAAGG ACTGGAAGAC GCCATGGAAA ATGGTGTGTT AGCTGGTTAT 1400 

CCA^GTTG ACGTAAAGGC AGAACTGTTT GATGGTTCTT ACCATGACGT 1450 

CGACTCTAAT GAAACAGCCT TCCGTATTGC GGCTTCAATG TCTCTACGTG 1500 

AAGCTGCGAA AAAGGCAGAT CCAGTTATTC TTGAACCGAT GATGAAAGTA 1550 

ACAATTAGTA TCCCTGAAGA ATATCTAGGT GATATTATGG GACATGTTAC 1600 

AGCTCGTCGT GGTCGTGTTG AAGGAATGGA TGCTCACGGT AATGCACAAA 1650 
CTGTAAATGC GT 



2) INFORMATION FOR SEQ ID NO: 2231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2231 

AAGTAGAACG TTCTCTACGT GTACTTGACG GTTCTGTTGC GGTGTTCAGT 
GCTAAAGGTG GCGTTGAACC TCAATCCGAA ACAGTATGGC GTCAGGCTTC 
?aIctACGGC GTACCTCGTA TCGCTTATGT AAATAAGATG GATACTGTAG 150 
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200 
250 



GTGCTGACTT CTTCAACGTA GTTGACATGA TGAAAGCTCG TTTGGGTGCA 
AATTCCGTAG CTATCCAAGT ACCAATCGGT GCTGAAGATA CTTTCGAAGG 

CATCATTGAC TTGATGACTA TGAAAGCGGA AATTTATAAA TCCGATGACG 3 00 

GTAAAGAATA TGAAATCACT GATATCCCTG CTGAATATCA AGAAGTAGCA 3 50 

GAAGCTCGTC GCGAAATGAT GATCGATGCT ATCGCTGAAA CAGATGATGA 400 

TATCATGATG AAATATTTGG AAGGCGAAGA AATTTCTGTG GAAGAATTGA 450 

AAGCGGCATT GCGTAAAGCT GTTATTGCTA ACCAATTATT CCCAGTTCTT 500 

TGTGGTTCTT CCTATAAAAA TAAAGGTGTT CAAATGTTAT TGGATGCTGT 550 

TATCGATTAC ATGCCAGCTC CAATCGACAT CCCACCTATT AAAGGTGTTG 600 

TTCCTGGTAC TGAAGAAGAA ACAACTCGTC CTTCTTCCGA TGAAGAGCCA 650 

TTCTCTGCAT TGGCATTCAA AATCATGGCT GACCCTTATG TTGGTAAATT 700 

AGCGTTCTTC CGTGTGTACT CCGGTACATT GGAATCTGGC TCCTACGTTT 750 

TCAACTCCAC TAAAGGTAAA AAAGAACGTA TCGGTCGTAT TCTTCAAATG 800 

CACGCTAACT CCCGTAAAGA AATCGAACGC GTATATTCTG GTGACATCGC 850 

TGCGGCGGTT GGCTTAAAGG ATACTACTAC AGGCGACACA TTGTGTGATG 900 

AAAAATCTCC TGTAATCCTT GAGTCCATGG AATTCCCTGA ACCAGTTATC 950 

TCCGTTGCTG TTGAACCTAA AACAAAAGCT G AC C AAG AAA AAATGGGTAC 1000 

AGCTCTTGCT CGTTTGGCAG AAGAAGATCC TACTTTCAAA GTTCGTACTG 1050 

ATGAAGAAAC AGGTCAAACT ATTATCTCTG GTATGGGCGA ACTTCACTTG "00 
GATATCATCG TTGACCGTAT GAACCGTGAA TTCAAAGTAG ATTGTAACGT 
AGGTAAACCT CAAGTAGCAT ACCGCGAAAC TATCCGTAAA GCTGTTAAGG 
TTGAAGGTAA ATTCGTACGT CAATCTGGTG GTCGTGGTCA ATATGGTCAC 

SSgOTTOG AATTgItTCC TCAAGAACCA GGTGCTGGCT TCGAGTTTGA 1300 

AAACAAGGTT otIgGTGGTG CGATTCCTCG TGAATACATC GGACCTGTTG 1350 

JaaScGGTGT TAAAGAAGCT ATGGAATCCG GTGTTATCGC TGGGTACCCT 1400 

ATGGTTGATG TTAAAGTTAT CGTATTTGAT GGTTCTTACC ATGACGTTGA 1450 

C^cIIcSaA I?SgCCTTCA AAATTGCTGG TTCTATGGGC TTCAAAGAAG 1500 

GTGCTCGCAA AGCAGACCCT GCATTGCTTG AACCATATAT GGCTGTAGAA 1550 

OTAGACGTTC CTGAAGAATA CATGGGCGAC GTTATCGGTG ACTTGAACTC 1600 

TCGTCGTGGT CGCATGGACG GCATGGAAGC TCGTAATGGT TCCCAACATA ifiSO 
TC 



1100 
1150 
1200 
1250 
1300 
1350 



1650 
1652 



2) INFORMATION FOR SEQ ID NO: 2232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2232 

AGTAGAGCGT TCCATGCGTG TTCTTGACGG CGCGGTAATG GTTTACTGTG 
CAGTTGGTGG TGTTCAGCCA CAGTCTGAAA CCGTATGGCG TCAGGCTAAT 
AAATATAAAG TTCCACGTAT TGCGTTCGTT AACAAAATGG ACCGTATGGG 
TGCGAACTTC CTGCGCGTAG TTGGTCAACT GAAATCTCGC CTTGGTGCGA 
I^CCAGTTCC ACTGCAGTTG GCAATTGGCG CAGAAGAAAA ATTCACCGGT 
a^Scgatc TGGTGAAAAT GAAAGCGATC AACTGGAACG AAGCTGATCA 
GGGCGTGACC TTCGAATATG AAGAAATCCC TGCTGATATG GCTGAACTGG 
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CTGCTGAATG GCACCAGAAT CTGGTTGAAT 
GAGCTGATGG ACAAATACTT GGGTGGCGAA 
CAAGAAAGCT TTACGTCAAC GTGTTCTGAA 
CCTGTGGTTC TGCGTTTAAA AACAAAGGCG 
GTTATTGAGT ACCTGCCTGC ACCAACTGAC 
CTTGGATGAT GGCAAAGATA CTCCGGCTGT 
AGCCGTTCTC TGCTCTGGCG TTCAAAATCG 
AACCTGACGT TCTTCCGCGT GTACTCTGGT 
CGTTCTGAAC TCAGTGAAAT CGCAACGTGA 
AGATGCACGC TAACAAGCGT GAAGAGATCA 
ATCGCAGCCG CTATCGGTCT GAAAGATGTG 
TGACCCGAAT AATCCGATCA TCTTGGAACG 
TAATCTCTGT TGCTGTTGAA CCAAAAACCA 
GGTATGGCTC TGGGGCGTTT GGCGAAAGAA 
GACTGACGAA GAATCTGGTC AGACTATCAT 
ATTTGGATAT CCTGGTTGAC CGTATGCGCC 
AACGTCGGTA AACCTCAGGT TGCGTACCGT 
TAAGGATGTG GAAGGTAAGC ACGCTAAGCA 
ATGGTCATGT TGTTATCGAC ATGTCTCCAT 
TATGAATTCG TCAACGAAAT CGTTGGTGGT 
TCCGGCCGTT GATAAAGGTA TTCAAGAACA 
CAGGTTACCC AGTTGTTGAC GTTAAAGTGC 
CATGACGTTG ACTCCTCAGA ATTGGCATTT 
CTTTAAAGAA GGTTTCAAAC GAGCTAAACC 
TGAAGGTTGA AGTCGAAACC CCTGAAGATT 
GACCTGAACC GTCGTCGCGG TATC 



PCT/CAOO/01150 

CTGCGGCAGA AGCGTCTGAC 4 00 

GAGCTGACCG AAGAAGAAAT 4 50 

AAGCGAAATT ATTCTTGTTA 500 

TACAGGCAAT GCTGGATGCG 550 

GTTGAATCAA TCAACGGCAT 600 

TCGTCATTCT GACGACAAAG 650 

CTACCGACCC ATTCGTGGGT 700 

ATTGTTAATT CCGGTGATAC 750 

ACGCTTAGGT CGTATCGTAC 800 

AAGAAGTTCA CGCCGGTGAT 850 

ACTACGGGTG ACACTTTGTG 900 

TATGGAGTTC CCAGAGCCGG 950 

AAGCTGACCA AGAAAAAATG 1000 

GATCCATCAT TCCGCGTTTG 1050 

CGCTGGTATG GGTGAGTTGC 1100 

GCGAATTTAA CGTGGAAGCA 1150 

GAAACTATCC GCGAAACCGT 1200 

GTCAGGCGGT CGTGGTCAGT 1250 

TGCCACCGGG TGGTGTTGGG 1300 

TCTATTCCTA AAGAATTCAT 1350 

GCTGAAATCT GGCCCTCTGG 1400 

GTCTGCACTA CGGTTCTTAC 1450 

AAATTAGCTG GTTCTATCGC 15 00 

AGTTCTGCTT GAGCCAATCA 1550 

ACATGGGTGA CGTAATGGGC 1600 

1624 



2) INFORMATION FOR SEQ ID NO: 2233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ra 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ 

GAACGCTCGA TGCGCGTGCT TGACGGTGCT 
GGGCGGTGTT CAGCCACAGT CGGAAACCGT 
ACAAAGTGCC ACGTCTGGCC TTCGTCAACA 
AACTTCTTCA AGGTGTACGA GCAGATGCGT 
GGTCCTGATC CAGATCCCTA TCGGCGCTGA 
TCGATCTGGT CAAGATGAAG GCTATCCTGT 
ATGAAATTCG ACTACGTCGA TATTCCTGCA 
CGAGTGGCGC GAAAAGATGG TTGAAGCTGC 
TGATGAACAA GTACCTGGAA GAAGGCGACC 
CAGGCGCTGC GTACCCGTAC CATCGCTTCG 
CGGTACCGCC TTCAAGAACA AGGGCGTACA 
TCGAATACCT GCCATCGCCA CTGGACATCG 

11 



igera 



ID NO: 2233 






TGCATGGTTT 


ACTGCGCAGT 


50 


GTGGCGTCAG 


GCTAACAAGT 


100 


AGATGGACCG 


TACCGGCGCC 


150 


GCTCGCCTGA 


AGGCCAACCC 


200 


AGACAACTTC 


AAAGGCGTGA 


250 


GGGACGAAGC 


GTCGCAAGGC 


300 


GAGCTGGCTG 


ATTCGGCCGC 


350 


TGCTGAAGCC 


ACCGAAGAGC 


400 


TGACCGAAGC 


CGAGATCAAG 


450 


GAAATCGTTC 


CGATGATGTG 


500 


GGCCATGCTG 


GACGCGGTCA 


550 


ACGATGTCGG 


CGGTACGGAC 


600 
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GAAGACGACC AGCCAACCAC CCGTCGCGCA 
GGCGCTGGCC TTCAAGATCA TGACCGACCC 
TCTTCCGCGT GTACTCGGGC GCCGTCAATT 
TCGGTCAAAG GTCGTAAAGA GCGTCTGGGC 
GAATCAGCGC GAAGAGATCA AAGAAGTGCG 
CGGTTGGCCT GAAAGACGTG ACCACGGGCG 
GCCATCATCA CGCTGGAAAA AATGATCTTC 
GGCAGTCGAG CCAAAAACCA AGGCCGACCA 
TGAACCGCCT GGCACAGGAA GATCCTTCGT 
GAATCGGGCC AGACCATCAT CGGTGGWATG 
TATCGTTGAC CGCATGAAGC GCGAATTCGG 
AGCCACAAGT GGCTTACCGC GAAACGATCC 
GAAGGCAAGT TCGTCAAGCA ATCCGGTGGT 
TGTGCTGAAG ATCGAGCCGC AAGAACCAGG 
ACGCCATCAA GGGCGGTACC GTTCCTCGCG 
AAAGGCGTGC GCGGCACCCT GAACACCGGC 
CGTGGACGTC AAGGTCACGC TGTTCTTCGG 
CGAACGAAAA CGCGTTCCAG ATGGCCGCTT 
TGCCGCAAAG CATCGCCAGT CATTCTGGAG 
GGAAACGCCG GAAGACTACG CCGGTACCGT 
GCCGCGGTAT GGTGCAGGGC ATGGACGAAA 



PCT/CA00/01150 

GCTGACGACG AGAAATTCTC 6 50 

GTTCGTCGGT CAATTGGCCT 700 

CGGGCGACAC CGTGTACAAC 750 

CGTATTCTGC AGATGCACGC 800 

CGCCGGCGAC ATCGCCGCTG 850 

AAACCCTGTG CGATCCGACC 900 

CCTGAGCCTG TGATTCAACA 950 

GGAAAAAATG GGCCTGGCAC 1000 

TCCGCGTGAA GACCGATGAA 1050 

GGCGAGCTGC ACCTGGAAAT 1100 

CGTGGAAGCA ACCGTCGGCA 1150 

GTAAAACCTG CGAAGAATCG 1200 

CGTGGTCAAT ACGGTCACGT 1250 

CAAGGGCTTC GAGTTCGTTG 1300 

AGTACATCCC TGCGGTGGAA 1350 

GTGCTGGCTG GTTACCCGGT 1400 

TTCGTACCAC GATGTGGACT 1450 

CGATGGCATT CAAAGAAGGC 1500 

CCAATGATGG CTGTGGAAGT 1550 

GATGGGCGAC CTGTCGTCCC 1600 

TCCCAG 1636 



2) INFORMATION FOR SEQ ID NO: 2234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2234 

GGCACGAACG ATCTTGCCGG ACGGGCCATC TTCCATACCT TCGATCAGGC 50 

CACGACGACG GTTCAGGTCA CCGATCACGT CGCCCATATA GTCTTCCGGA 100 

GTCTCGACTT CTACTTTCAT GATCGGTTCG AGCAGAACCG GGTTGGCCTT 150 

CATGAAGCCA GCCTTAAAGG CCATGGAAGC AGCGATCTTG AACGCCAGTT 200 

CGGAAGAGTC GACATCGTGG TAGGAACCGA AGTGCAGACG CACGCCCAGA 250 

TCCATAACCG GATAACCTGC CAGCGGGCCG GACTTCAGTT GCTCGCGGAT 300 

ACCCTTGTCA ACACCCGGGA TGAACTCACC AGGAATGACG CCGCCCTTGA 350 

TGTCGTTGAC GAATTCGTAG GCTTTGCCTT CTTCCAGCGG GTACATGTCG 4 00 

ATCACAACGT GACCGTACTG ACCACGACCA CCGGACTGCT TGGCGTGCTT 450 

ACCTTCGATA TCCTTGACGG TGTTACGAAT GGTTTCACGG TAGGCAACCT 500 

GCGGCTTACC TACGTTCGCT TCTACCTTGA ACTCGCGACG CATACGGTCA 550 

ACGATGATGT CCAGGTGCAG CTCACCCATA CCGGCGATGA TGGTTTGGCC 600 

AGACTCTTCG TCAGTCCATA CGCGGAAGGA CGGGTCTTCC TGAGCCAGAC 650 

GGCCCAGAGC CAGGCCCATC TTCTCTTGGT CAGCCTTGGT TTTCGGCTCA 700 

ACTGCGATGG AGATTACCGG TTCCGGGAAT TCCATACGCT CGAGGATGAT 750 

CGGCGCTTTT TCGTCACACA GGGTGTCACC GGTGGTCACG TCTTTCAGAC 800 

CAATGGCGGC AGCGATGTCG CCTGCGCGAA CTTCTTTGAT CTCTTCACGC 850 
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TTGTTGGCGT GCATCTGAAC GATACGGCCA 
AACGGAGTTC AGCACGGAGT CACCGGAGTT 
GGAAGAAGGT CAGGTTGCCT ACGAACGGGT 
AGAGCAGCAA ACGGCTCGTC ATCAGAAGCA 
GCCATCCAGC TTCAGACCGT CGATGGCTGC 
GATAGTCAAC CACGGCATCC AGCATGGCCT 
GCGGAGCCAC AGGTAACCAG GATGATTTCG 
AAGAGCTTTC TTGATCTCTT CCTCGGTCAG 
TTTCCATCAG GTCTTCAGAC GCTTCAGCAG 
CGCATTTCTT GCGCTTTTTC CAGCAGCTCA 
GAAGGATACG CCCTGGTCAG CTTCGCTCCA 
CCAGGTCGAT AACGCCCTTG AAGTTCTCTT 
AGCGGAACCG GGTTACCTTT CAGACGGGTC 
CAGGAAGTTG GCACCGGTAC GGTCCATCTT 
GAACCTTGTA CTTGTTAGCC TGACGCCATA 
ACGCCACHTA CGGCACAGTA CACCATCACG 
GGAACG 



PCT/CA00/01150 

AAACGCTCGC GCTTCTCTTT 900 

AACCACACCG GAGTAAACGC 950 

CGGTAGCAAT CTTGAATGCC 10 00 

TGACGCTCGT CTTTGGTCTC 1050 

TACGTCGGTC GGCGCCGGCA 1100 

GTACGCCCTT GTTCTTGAAC 1150 

TTGTTCAGAA CACGCTGACG 12 00 

TTCTTCACCA CCCAGGTATT 12 50 

CGGCTTCAAC CAGGGTCATG 130 0 

GCCGGGACGT CTTCGTAATC 13 50 

GTTGATGGCT TTCATCTTGA 14 00 

CTGAACCGAT GTTCAGTTGC 14 50 

TTGATGTGCT CAACGCAGCG 150 0 

GTTGACGAAC GCGATACGGG 1550 

CGGTTTCAGA CTGTGGCTGT 160 0 

GCGCCGTCCA GAACACGCAT 1650 

1656 



2) INFORMATION FOR SEQ ID NO: 2235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2235 

TCTCCTCCCC ATTTGATAAC TACCAAATGA ACGCTATCGA CTGGTTATGT 50 

CAGTCATAAC CAGTTGATTT TTAAGAGAGT TCTTTGGTAT AATTACAATC 100 

GGTAGATACT GTTATAGAAT CTAACAAAAC TCAATTAATA GGAGGAATCA 150 
TTTAA 



155 



2) INFORMATION FOR SEQ ID NO: 2236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TCTTTCGATT ACTATAAGCC CTAACTAATT 
TCATGGAGCG ACCCTCCATG AGTAGTTAAT 



PCT/CA00/01150 

ID NO: 2236 

CTTAGTTAAA AACCAAGTGC 50 
AAAGGAAGAT CATC 94 



2) INFORMATION FOR SEQ ID NO: 2237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2237 

GTCCCACCGG GGGTGAGTGG GGGCCGGGCA ATCGGCCCCT GCGAGCCGCC 
TCACACACCA GTAATCCCAG TAGGTCTCAT GCCCCATGGG TGTTAAAGTG 
ACACCTAGCC GTAGGCTGAG AATTTCTACC CGAGTCCAGG AGGACGAAAA 



2) INFORMATION FOR SEQ ID NO: 2238 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

• (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2238 
TTCAGTCCTT TAGGCAAGGA GTTAATTGTC 



2) INFORMATION FOR SEQ ID NO: 2239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2239 

CAACTAGCCT CTCGCTTATC TCCCTATGCA AAATTTCTAG TCAGTTACAG 50 

GGCATAGGGG GAGCGTAGGC GGGGGTAGCG GCTTGCTGAG CACTTCCTCT 100 

ACATCAAAGG GAATGTTGAG CCGGCCGTTA CCCTGTACGA TCCCATCTGG 150 

TTTCTTCGGT GGTTTGATAA ATACCCCGTT GTGACCCTAG GATCATGTAA 200 

CTGGCACAAT GTAAATAGCT GTACTGCCAG GCTGCCGAAT TAGCAGTCAG 2 50 

AAATGTACAG CACTGTCAAC TCGTGGCTGC GAAATCGTAG CCACCACGAA 3 00 

GTCCAGGAGG ACACACA 317 



2) INFORMATION FOR SEQ ID NO: 2240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2240 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCTGAG GGGAGAGCAC 
TATAGTAAGG AATATAGCC 



2) INFORMATION FOR SEQ ID NO: 2241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2241 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCGAAG GGGAGAGCAC 5 
TATAGTAAGG AATATAGCC 6 
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2) INFORMATION FOR SEQ ID NO: 2242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2242 

AACAACAAAG AAGATTAATT GATTTTTTTC GCAACATCAA GTATAACTTT 
AGTTAGAAGT ATTACTTAGT TTAAATTTAA GCTAAGTAAA AAATAATTAT 
CGAATTATCG AGGAGGATAT TTTAAA 



2) INFORMATION FOR SEQ ID NO: 2243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2243 

GTGTCAACTC ACTGGCTCGG AGCCGAGCAA TCGGCTCAGC GAAGGCGACG 
GGTCAGTAGC TACTGGCAGC GGAGTAATCT TGCCGGGTCA TTGGAATGCC 
TTGGGCGCGG CACAACTGAA AACACCAACA CTGCTTTAAC AAGCACCAAC 
TAGTCCAGGA GGACACAGAA 



2) INFORMATION FOR SEQ ID NO: 2244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2244 

GGCCGGCAAG CCTGCGAGTA AGCTGACGCG GTTAGCACCG CGGCAAAACC 
AAGAAAAATC AACACTGCTT TTTTAAGCAC CAACAGTCCA GGAGGACAAC 
AAA 



2) INFORMATION FOR SEQ ID NO: 2245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2245 

GCCGAGCGTT GCGCGTAAGC TAGCTCGGTT ACCACGGCGG CAAAACTAGA 
AAAACATCAA CACTGCTTTT ATAAGCACCA ACAGTCCAGG AGGACACAGA 
A 



2) INFORMATION FOR SEQ ID NO: 2246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2246 

CCAGCTTCCC GCCCGTGCCG CTACGGTGGC ACAAACCCAA AAAGATCAAT 
CCTGCTGTAA CCCAGCACCA ACAAGTCCAG GAGGACAAGA A 



2) INFORMATION FOR SEQ ID NO: 2247 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2247 

TCAGGCAAAT AGGCCGTCTG AAAGGCTGAA ATGATTTTTC AGACGGCATT 
GTTCTTTAAT CGATCTTTAA TGTAAAGGAA TTAGCTC 



2) INFORMATION FOR SEQ ID NO: 2248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2248 

TATAACTTGT TAAGACTAGC TATGCTAGGT TAAAATACAG GTTGAGCTTA 50 

TTTATAAGCT GACATTTTTA TGATTTGATT TTTAGGGGTA AATGCATTAT 10 0 

AAAAGAATTA TAAATTCTTT TATGCTACAC TCAATCAATT TTCTTCTCAT 150 

GATGGTGAGA AACTATCATG AGAGATAAAT TTGAAATAAC TTTTATTAAG 200 

AATAGGAGAG ATTTAATA 218 



2) INFORMATION FOR SEQ ID NO: 2249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TTAAACTTGT TTTAGCTAGA ATTTCTAGGT 
TAATTAAGCT AACATCTTTA TGAATTGATT 
AAATGAATTA TGAATTCTAA CAATCATTAT 
TATCATGAGA GATAATATTG AAATAACTTT 
AATA 



ID NO: 2249 

AAAATACAGC GTAAGCTTAT 50 
TTTTACTGAA AATGCATTAT 100 
GTCTCATGAT GGTGAGAAAC 150 
TACTAGAATA GGAGAGATTT 2 00 

204 



2) INFORMATION FOR SEQ ID NO: 2250 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2250 
CCGTCGGATG GTGTCGTATA CCGCGGAGTC GCCGACGG 



2) INFORMATION FOR SEQ ID NO: 2251 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2251 
CGGAGCCGTT CTCGCTGCGT TACATGCTGG TGGCTCCG 



2) INFORMATION FOR SEQ ID NO: 2252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2252 
CCACATACAG TGTCTCTC 
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2) INFORMATION FOR SEQ ID NO: 2253 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2253 
CATTACCCAA CCGAAAGTA 



2) INFORMATION FOR SEQ ID NO: 2254 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2254 
ACCTGAACAG AGAGAAATG 



2) INFORMATION FOR SEQ ID NO: 2255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2255 

ATGGCAAAAG AAAAATTTGA CCGTTCAAAA CCACACGTTA ACATTGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC ATTAACTGCT GCTATCACAA 100 

CTGTTTTAGC TAAGAAAGGT TTCGCGCAAG CTCAAGATTA CGGTTCAATC 150 

GATAAAGCTC CAGAAGAACG CGAACGTGGT ATCACAATCA ACACTTCTCA 200 

CGTTGAGTAC GAAACAGACA CTCGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCGGA CTACGTTAAA AAC 273 
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2) INFORMATION FOR SEQ ID NO: 2256 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2256 

ATGGCTAAAG CCAAGTTTGA ACGTAATAAA CCACACGTAA ACGTGGGTAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCGATTGCAA 100 

CAATTTGTGC AAAAACTTAC GGCGGTGAAG CGAAAGATTA CTCACAAATC 150 

GACTCAGCAC CTGAAGAAAA AGCACGTGGT ATTACAATTA ATACATCACA 2 00 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTT GACTGCCCAG 250 

GCCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ 

GTGGCGAAGG CCAAGTTTGA GCGCACCAAG 
GATTGGTCAC GTTGACCACG GCAAGACGAC 
AGGTGCTGCA TGACAAGTAC CCCGAACTGA 
CAGGTCGACA ACGCTCCCGA GGAGCGCGAT 
CTCTCACGTT GAGTACCAGA CCGAGGCGCG 
CTCCCGGCCA CGCCGACTAC GTCAAGAAC 



meyeri 



ID NO: 2257 

CCGCACGTCA ACATCGGCAC 50 

GCTGACGGCA GCTATCACCA 10 0 

ACGAGTTCAC CCCCTTCGAT 150 

CGTGGCATCA CGATCAACGT 2 00 

TCACTACGCG CACGTTGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2258 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 273 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ 

GTGGCTAAAG AAAAATTTGA TCGTTCCCTA 
TATCGGTCAC GTTGACCATG GTAAAACCAC 
GCGTTTGCTC CGAAGTATTC GGTTCCGCAA 
GACAGCGCAC CAGAAGAAAA AGCTCGTGGT 
CGTTGAATAC AACTCGCTGA TCCGTCACTA 
GTCACGCTGA CTATGTGAAG AAC 



difficile 



ID NO: 2258 

CCGCACGTCA ACGTTGGCAC 50 

TCTGACTGCT GCTCTGACTC 100 

TCGTTGATTT CGATAAAATC 150 

ATCACCATCA ACACCGCGCA 200 

CGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtherial 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2259 

GTGGCAAAGG CTAAGTTCGA GCGTACCAAG CCGCACGTCA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCT GCTATCACCA 10 0 

AGGTTTTGGC AGACGCTTAC CCAGAGCTGA ACGAAGCTTT CGCTTTCGAT 150 

GCCATCGATA AGGCACCGGA AGAGAAAGAG CGTGGTATTA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG CCACTACGCA CACGTTGACG 250 

CTCCAGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2260 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 2 50 

GTCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2261 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA 
TATCGGCCAC GTTGACCATG GTAAAACTAC 
CCGTTCTGGC TAAAACCTAC GGTGGTTCCG 
GATAACGCGC CGGAAGAAAA AGCTCGTGGT 
CGTTGAATAT GACACCCCGA CTCGCCACTA 
GCCACGCCGA CTATGTTAAA AAC 



pneumoniae subsp. pneumoniae 



ID NO: 2261 

CCGCACGTCA ACGTTGGTAC 50 

CCTGACTGCT GCCATCACTA 100 

CTCGCGCATT CGACCAGATC 150 

ATCACCATCA ACACCTCTCA 2 00 

CGCGCACGTA GACTGCCCGG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2262 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2262 

ATGGCAAAAG AAAAATTTGA CCGCTCTAAA CCCCATGTTA ACATTGGTAC 
TATTGGACAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCAATTACAA 

1160 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CTGTACTTGC TAAAAAAGGC 
GATGGTGCTC CAGAAGAAAG 
CGTTGAGTAC CAAACTGACA 
GACATGCCGA TTACGTTAAA 



TATGCTGATG CACAAGCTTA 
AGAACGTGGA ATCACAATCT 
GCCGTCACTA TGCACACGTT 
AAC 



PCT/CA00/01150 

TGACCAAATT 150 
CTACTGCTCA 200 
GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2263 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCCG CGCGTTCGAC 150 

CAGATCGACA ACGCGCCCGA GGAGCGTCAG CGCGGTATCA C CAT C AACAT 20 0 

CTCCCACGTG GAGTACCAGA CCGACAAGCG GCACTACGCT CACGTCGACG 250 

CCCCGGGTCA CGCCGACTAC ATCAAGAAC 279 

2) INFORMATION FOR SEQ ID NO: 2264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2264 

GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC TCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCAA GGCGTTCGAC 150 

CAGATCGACA ACGCGCCTGA GGAGCGTCAG CGCGGTATCA CGATCAACAT 200 

CGCGCACGTG GAATACCAGA CCGAGAAGCG TCACTACGCG CACGTCGACG 2 50 

CCCCCGGCCA CGCCGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2265 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2265 

GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCCCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC CCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA TGACAAGTTC CCGGACCTGA ACGAGTCGAA GGCGTTCGAC 150 

CAGATCGACA ACGCTCCTGA GGAGCGCCAG CGCGGTATCA CGATCAACAT 20 0 

CGCGCACGTG GAGTACCAGA CCGAGAAGCG GCACTATGCA CACGTCGACG 250 

CGCCGGGCCA CGCCGACTAC ATCAAGAAC 27 9 



2) INFORMATION FOR SEQ ID NO: 2266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2266 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTCA ACGAGTCGCG TGCGTTCGAC 150 

CAGATCGACA ACGCTCCCGA AGAGCGTCAG CGCGGTATCA CCATCAACAT 2 00 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG GCACTACGCC CACGTCGACG 250 

CTCCTGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2267 

ATGGCTAAGG AAAAATTCGA ACGTAGCAAA CCGCACGTAA ACGTTGGCAC 50 

CATCGGTCAC GTTGACCATG GTAAAACCAC TCTGACTGCT GCTTTGACTA 10 0 

CTATTTTGGC TAAAAAATTC GGCGGTGCTG CAAAAGCTTA CGACCAAATC 150 

GACAACGCAC CCGAAGAAAA AGCACGCGGT ATTACCATTA ACACCTCGCA 20 0 

CGTAGAATAC GAAACCGAAA CCCGCCACTA CGCACACGTA GACTGCCCGG 2 50 

GTCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2268 

ATGGCAAAAG AAAAATTTGA TCGCTCAAAA GAACATGCCA ATATTGGTAC 50 

TATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCTATCGCAA 100 

CTGTATTAGC TAAAAATGGT GACACTGTTG CACAATCATA CGATATGATT 15 

GACAACGCTC CAGAAGAAAA AGAACGTGGT ATTACAAT C A ATACTGCACA 2 0 

TATCGAATAC CAAACTGACA AACGTCACTA TGCTCACGTT GACTGCCCAG 25 

GACACGCTGA CTATGTTAAA AAC 27 



2) INFORMATION FOR SEQ ID NO: 22 69 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2269 
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ATGGCAAAAG AAAAATTTGA TCGCTCAAAA 
TATCGGTCAC GTTGACCATG GTAAAACTAC 
CTGTATTAGC TAAAAATGGT GACACTGTAG 
GACAACGCTC CAGAAGAAAA AGAACGTGGT 
CATCGAGTAT CAAACTGACA AACGTCACTA 
GACACGCTGA CTATGTTAAA AAC 



PCT/CA00/01150 

GAACATGCCA ATATTGGTAC 50 
TTTAACAGCT GCTATCGCAA 100 
CACAATCATA TGACATGATT 150 
ATCACAATCA ATACTGCACA 200 
TGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2270 

CGATCCTGGT AGTAGCAGCG ACTGACGGCC CGATGCCGCA GACTCGTGAG 50 

CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCAT 10 0 

GAACAAGTGT GACATGGTAG ATGACGAAGA GCTGCTGGAA CTGGTCGAGA 150 

TGGAAGTTCG CGAACTGCTG TCCGAGTACG ACTTCCCGGG TGATGACCTG 20 0 

CCGGTAGTCC GTGGTTCYGC ACTGAAAGCG CTGGAAGGCG AAGCTCAGTG 250 

GGAAGAGAAG ATCCTGGAAC TGGCTGGCCA CCTGGACACC TACATTCCGG 300 

AGCCGGAGCG TGCCATCGAC CTGCCGTTCC TGATGCCTAT CGAAGACGTA 350 

TTCTCCATCG CTGGCCGYGG TACCGTAGTG ACCGGTCGTG TAGAGCGCGG 4 00 

TATCGTCAAA GTTGGTGAAG AAGTGGAAAT CGTKGGTATC AAAGATACCA 450 

CCAAGACCAC CTGTACCGGC GTTGAAATGT TCCGCAAACT GCTGGACGAA 500 

GGTCGTGCAG GCGAGAACAT CGGTGCACTG CTGCGTGGCG TGAAGCGTGA 550 

AGACGTAGAG CGTGGTCAGG TACTGGCCAA GCCGGGCACC ATCAAGCCGC 600 

ACACCAAGTT YGAATCTGAA GTGTACGTGC TGTCCAAAGA AGAAGGTGGT 650 

CGTCATACCC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 700 

TACCGACGTG ACCGGTACCA TCGAACTGCC GGAAGGCGTA GAGATGGTAA 750 

TGCCGGGCGA CAACATCAAG ATGGTTGTTA CCCTGATTGC GCCGATCGCG 800 

ATGGACGACG GC 812 



2) INFORMATION FOR SEQ ID NO: 2271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bilophila wadsworthia 
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STRAIN: ATCC 4 9260 



(xi) SEQUENCE DESCRIPTION 



CGACGGTCCC 
TCGGCGTGCC 
GACCCCGAAC 
CTCCTACGGC 
TGAAGGCTCT 
CTCGAACTGC 
AACCGACAAG 
GCCGCGGTAC 
GGCGAAGAAG 
CACCGGCGTC 
ACAACATCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCACCGG 
GGTATCATCG 
CGCTACCTTT 



ATGCCCCAGA 
TCACCTCGTC 
TGCTCGAACT 
TACCCCGGCG 
GGAATCCGAT 
TCGCCGCTTG 
CCCTTCCTGA 
CGTGGTCACC 
TCGAAATCGT 
GAAATGTTCC 
CGCTCTGCTC 
TCGCCGCTCC 
TACGTTCTGT 
CTATCGTCCT 
CTCTTGAAGA 
AATGTCGAGC 



CCCGTGAGCA 
GTGTTCATGA 
CGTCGAAATG 
ATGAAATCCC 
AGCGCTGATT 
CGACAGCTAC 
TGCCCATCGA 
GGTCGTGTGG 
GGGTATCCGT 
GCAAGCTGCT 
CGCGGCACGA 
CAAGAGCATC 
CCAAGGAAGA 
CAGTTCTACT 
AGGCGTTGAA 
TCATTCACCC 



ID NO: 2271 






CATCCTGCTC 


GCCCGTCAGG 


50 


ACAAGTGTGA 


CCTCGTCGAC 


100 


GAAGTCCGCG 


AACTGCTGAG 


150 


GGTTGTCCGC 


GGTTCCGCTC 


200 


CCCCTGACGC 


CCAGTGCGTG 


250 


TTCCCGGATC 


CGGTCCGCGA 


300 


AGACGTGTTC 


TCCATCTCCG 


350 


AACGTGGCAT 


CATCAAGGTC 


400 


CCCACCGTGA 


AGACGACCTG 


450 


CGATCAGGGC 


CAGGCCGGCG 


500 


AGCGTGACGA 


AGTGGAACGC 


550 


ACGCCCCACA 


AGAAGTTCAA 


600 


AGGCGGCCGC 


CATACCCCGT 


650 


TCCGTACCAC 


CGACATCACC 


700 


ATGGTTATGC 


CCGGCGATAA 


750 


CATCGCCATG 


GAAAAGGGC 


799 



2) INFORMATION FOR SEQ ID NO: 2272 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevundimonas diminuta 

(B) STRAIN: ATCC 11568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2272 



GATCATGCCG GGCGACAACG CCGAGCTGGA CGTCGA 



ACCCGCGAGC 


50 


GGTGTTCCTG 


100 


TGGTCGAGAT 


150 


GACGACATCC 


200 


CGACCCGCAC 


250 


ACAGCTACAT 


300 


CCGGTGGAAG 


350 


TCGCGTCGAG 


400 


GCATCCGTCC 


450 


AAGCTGCTGG 


500 


CGGCACCAAG 


550 


GTTCGATCAC 


600 


AAGGAAGAAG 


650 


GTTCTACTTC 


700 


GCGTCGAGAT 


750 




786 
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2) INFORMATION FOR SEQ ID NO: 2273 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2273 

GCTATTATGG CTGGATTACT ATCCAATACG CTATCCAAGA ATCCCGTAAC 50 

GTACCTGCCG TCAAATCGCT GGAAGCAGTC GGATTAGATA ATTCATTGAA 100 

ATTCCTCAAT GGCCTTGGTA TCAATTATCC TGAGATGCAT TATTCTAATG 150 

CGATTTCAAG TAATACAAGC GAATCTGGTA ACCAATACGG AGCAAGTAGC 200 

GAAAAAATGG CTGCCGCTTA CGCTGCCTTT GCTAATGGCG GTACATATTA 250 

CAAACCGCAA TACGTCAACC GAGTTGTCTT TAGCGACGGT ACAGAAAAAG 3 00 

TCTTTTCAAA TGGCGGATCA AAAGCCATGA AAGAGACGAC AGCCTACATG 350 

ATGACAGACA TGATGAAGAC CGTTCTTCAG TCTGGAACTG GTACCAATGC 4 00 

TGCAATCCCA GGAGTTTATC AAGCAGGTAA AACTGGTACT TCCAACTATG 4 50 

CAGATGATGA GCTAGAGAAG TTGACAAAAC CTTATTACAG TTCTAGCATT 500 

GTCACACCAG ACGAACTATT TGTTGGCTAT ACTCCACAAT ACTCTATGGC 550 

TGTTTGGACA 560 

2) INFORMATION FOR SEQ ID NO: 2274 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2274 



50 
100 
150 



GCTACTTTGG AAACATTACT GTCCAATATG CTCTTCAACA ATCACGTAAT 
GTCACAGCCG TTGAAACTTT GAATAAGGTC GGTCTAGATA AGGCTAAAGC 
CTTCCTTAAT GGGCTTGGTA TTGATTATCC AAGCATGCAT TATGCAAACG 
CCATTTCAAG TAATACAACT GAATCCAACA AAAAATACGG TGCAAGTAGT 200 
GAAAAAATGG CTGCTGCCTA CGCTGCTTTT GCTAATGGTG GTATTTACCA 250 
CAAGCCAATG TACATCAATA AAATCGTCTT TAGCGACGGT AGTGAGAAAG 300 
AATTTTCTGA TGCCGGCACA CGAGCTATGA AAGAAACTAC TGCCTATATG 350 
ATGACTGAAA TGATGAAAAC AGTCCTAGTA TACGGTACCG GACGTGGAGC 400 
CTACCTACCA TGGCTTCCAC AAGCAGGTAA GACAGGTACT TCTAACTATA 450 
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CTGACGACGA AATTGAAAAG 
GATGAAATGT TTGTAGGGTA 
A 



TATATCAAGA ACACTGGCTA 
TACTCGTAAA TATGCAATGG 
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CGTAGCCCCA 5 00 

CTGTTTGGAC 550 

551 



2) INFORMATION FOR SEQ ID NO: 2275 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2275 

GCTATTATGG CTGGATTACT ATCCAATACG CTATCCAAGA ATCCCGTAAC 50 

GTACCAGCCG TCAAATCGCT GGAAGCAGTC GGATTAGATA ATTCATTGAA 10 0 

GTTCCTCAAT GGCCTTGGTA TTAATTACCC TGAAATGCAT TATTCTAATG 150 

CGATTTCAAG TAATACAAGC GAATCTGGTA ACCAATACGG AGCAAGTAGC 2 00 

GAAAAAATGG CTGCCGCTTA CGCTGCCTTT GCTAATGGCG GTACATATTA 2 50 

CAAACCGCAA TACGTCAACC GAGTTGTCTT TAGCGACGGT ACAGAAAAAG 3 00 

TCTTTTCAAA TGGCGGATCA AAAGCCATGA AAGAAACGAC AGCCTACATG 3 50 

ATGACAGACA TGATGAAGAC CGTTCTTCAA TCTGGAACTG GTAC CAATGC 4 00 

TGCAATTCCA GGAGTCTATC AAGCAGGTAA AACCGGCACT TCCAACTATG 4 50 

CAGATGATGA ACTAGAGAAG TTGACAAAAC CTTATTACAG TTCTAGCATT 50 0 

GTCACACCAG ACGAGCTGTT TGTTGGCTAC ACTCCACAGT ACTCTATGGC 550 
TGTTTGGACA 



560 



2) INFORMATION FOR SEQ ID NO: 2276 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2276 

TTATTTTGGA AATATCACCA TCCAATATGC GCTCCAACAA TCACGGAACG 
TTACAGCCGT AGAAACCTTG AACAAAGTCG GTTTGGATAG AGCCAAGACC 
TTCCTGAATG GAATCGGTAT TGACTATCCA GATATGCACT ATGCCAACGC 
GATTTCAAGT AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG 
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AGAAAATGGC TGCTGCTTAC GCTGCTTTTG 
AAACCAATGT ATATCAACAA AATCGTCTTT 
ATACGCTGAT CCTGGTACTC GTGCCATGAA 
TGACAGAAAT GATGAAGACT GTCTTGGCAT 
TATCTCCCTT GGCTACCTCA AGCTGGTAAG 
AGATGATGAA ATTGAAAACT ACATCAAAAA 
ACGAAATGTT TGTTGGTTAT ACTCGCAAAT 



PCT/CA00/01150 

CTAACGGTGG TATCTACCAT 250 

AGCGATGGTA GCTCAAAAGA 30 0 

AGAGACGACC GCCTATATGA 350 

ACGGAACGGG TCGTGGTGCT 400 

ACTGGTACAT CAAACTATAC 450 

TACTGGTTAT GTAGCCCCAG 500 

ATTCAATGGC TGTWTGGACA 550 



2) INFORMATION FOR SEQ ID NO: 2277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ 

GCTCCTATCT GGATTATGCG ATGTCGGTCA 
GATGTCCGAG ATGGCCTGAA GCCGGTACAC 
GAACGTACTA GGCAATGACT GGAACAAAGC 
TCGTTGGTGA CGTAATCGGT AAATACCATC 
TATGACACGA TCGTCCGCAT GGCGCAGCCA 
GGTAGACGGT CAGGGTAACT TCGGTTCTAT 
CAATGCGTTA TACGGAAATC CGTCTGGCGA 
GCCGAT 



coli 



ID NO: 2277 

TTGTTGGCCG TGCGCTGCCA 50 

CGTCGCGTAC TTTACGCCAT 100 

CTATAAAAAA TCTGCCCGTG 150 

CCCATGGTGA CTCGGCGGTC 200 

TTCTCGCTGC GTTATATGCT 250 

CGACGGCGAC TCTGCGGCGG 3 00 

AAATTGCCCA TGAACTGATG 350 

356 



2) INFORMATION FOR SEQ ID NO: 2278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2278 

CTCTTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGC GCGCTGCCGG 
ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 
AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT CAGCCCGTGT 
CGTTGGTGAC GTGATCGGTA AATACCACCC GCACGGCGAC TCCGCGGTAT 
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ATGACACCAT CGTTCGTATG 
GTGGATGGCC AGGGGAACTT 
AATGCGTTAT ACGGAAATCC 



GCCCAGCCGT TCTCGCTGCG 
CGGTTCAATC GACGGCGACT 
GTCTGGCGAA AATTGCTCAC 



PCT/CA00/01150 

CTACATGCTG 250 
CCGCCGCGGC 3 00 

GAACTGA 347 



2) INFORMATION FOR SEQ ID NO: 2279 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia 

(B) STRAIN: ATCC 43886 

(xi) SEQUENCE DESCRIPTION: SEQ 

AGAGCTCCTA TCTGGATTAT GCGATGTCGG 
CCAGATGTCC GAGATGGCCT GAAGCCGGTA 
CATGAACGTA CTAGGCAATG ACTGGAACAA 
GTGTCGTTGG TGACGTAATC GGTAAATACC 
GTCTATGACA CGATCGTCCG CATGGCGCAG 
GCTGGTAGAC GGTCAGGGTA ACTTCGGTTC 
CGGCAATGCG TTATACGGAA ATCCGTCTGG 
ATGGCCGATC TC 



coli 



ID NO: 2279 

TCATTGTTGG CCGTGCGCTG 50 

CACCGTCGCG TACTTTACGC 10 0 

AGCCTATAAA AAATCTGCCC 150 

ATCCCCATGG TGACTCGGCG 200 

CCATTCTCGC TGCGTTATAT 250 

TATCGACGGC GACTCTGCGG 3 00 

CGAAAATTGC CCATGAACTG 350 

362 



2) INFORMATION FOR SEQ ID NO: 2280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2280 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCAG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 10 0 

AACGTACTAG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTTGGTGAC GTAATCGGTA AATACCATCC CCATGGTGAC TCGGCGGTTT 200 

ATGACACGAT CGTCCGTATG GCGCAGCCAT TCTCGCTGCG TTACATGCTG 250 

GTAGACGGTC AGGGTAACTT CGGTTCCATC GACGGCGACT CTGCGGCGGC 3 00 

AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCCCAT GAACTGATGG 3 50 

CCGATCTC 358 
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2) INFORMATION FOR SEQ ID NO: 2281 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2281 
CCCCCAGCTG GGCGGCGGTA TCGATGGGGG 



2) INFORMATION FOR SEQ ID NO: 2282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2282 
AGRRGCIMAR ATGTATGA 

2) INFORMATION FOR SEQ ID NO: 2283 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2283 
ATITATGAYG GKITTCAGAG GC 



2) INFORMATION FOR SEQ ID NO: 2284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2284 
TCTGWGTRAC IGGYTCKGAG A 



2) INFORMATION FOR SEQ ID NO: 2285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2285 
CMCCICCWGG TGGWGAWAC 



2) INFORMATION FOR SEQ ID NO: 2286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 6 
AGTTGCTGTA TTAGGAAATG 



2) INFORMATION FOR SEQ ID NO: 2287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2287 
TCGAAGTTGC TGTATTAGGA 



2) INFORMATION FOR SEQ ID NO: 2288 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4339 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2288 

TAGAAGCTGG CTCGTTTTTT TATAAATAAG TTATTCGTTT ATTTTTGTTT 50 

TGTGCTAAAA TATGAGAGTA AATCACTGAA CGATTTAGAA TACAGGAGGA 100 

CAATCTTTTG AAGATTACTT TACTATATGG CGGACGCAGC GCAGAGCAGA 150 

GCATGAAGTG TCCATTCTTT CCGCATTTTC AGTTTTAAAT GCCATTTATT 2 00 

ATAATTATTA CCAAGTTCAA CTCGTATTTA TTACAAAAGA AGGACAATGG 250 

GTCAAAGGTC CATTACTAAC AGAAAAACCT GCCAGCAAAG ATGTCTTGCA 3 00 

TCTTTCATGG GACCCAAGTG GACAGACAGA GGAAGGCTTT ACAGGAAAAG 3 50 

TGATCAATCC GGGCGAAATC AAAGAAGAAG GAGCCATCGT TTTTCCAGTT 4 00 

TTACATGGGC CAAACGGGGA AGATGGAACG ATCCAAGGCT TCTTAGAGAC 4 50 

ATTGAATATG CCTTATGTCG GCGCAGGCGT ATTGACCAGT GCATGTGCCA 500 

TGGATAAAAT CATGACCAAG TATATTTTAC AAGCTGCTGG TGTGCCGCAA 550 

GTTCCTTATG TACCAGTACT TAAGAATCAA TGGAAAGAAA ATCCTAAAAA 600 
AGTATTTGAT CAATGTGAAG GTTCTTTGCT TTATCCGATG TTTGTCAAAC 6 50 

CGGCGAATAT GGGTTCTAGT GTCGGCATTA CAAAAGCAGA AAACCGAGAA 700 
GAGCTGCAAA ATGCTTTAGC AACAGCCTAT CAGTATGATT CTCGAGCAAT 750 
CGTTGAACAA GGAATTGAAG CGCGCGAAAT CGAAGTTGCT GTATTAGGAA 8 00 

ATGAAGACGT TCGGACGACT TTGCCTGGTG AAGTCGTAAA AGACGTAGCA 850 
TTCTATGATT ATGAAGCAAA ATATATCAAT AATAAAATCG AAATGCAGAT 900 
TCCAGCCGAA GTGCCAGAAG AAGTTTATCA AAAAGCGCAA GAGTACGCGA 950 

AGTTAGCTTA CACGATGTTA GGTGGAAGCG GATTGAGCCG GTGCGATTTC 1000 

TTTTTGACAA ATAAAAATGA ATTATTCCTG AATGAATTAA ACTCTATGCC 1050 

AGGATTTACG GAGTTCAGTA TGTACCCACT CTTATGGGAA AATATGGGCT 110 0 

TGAAATACGG TGATTTGATT GAAGAACTGA TCCAGTTAGG AATGAATCGA 1150 

TACCATCAGC GTCAATCTTT TTTTGAAAAA AATGAATAAA GAGAAATAAA 12 00 

GAAGAGGCTG GAGTGATTGC GTAACCGCGT TCATTCTAGC 124 0 



2) INFORMATION FOR SEQ ID NO: 2289 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2289 
CACCGAAGAA GATGAAAAAA 
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2) INFORMATION FOR SEQ ID NO: 2290 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2290 
TGGCACCGAA GAAGATGA 



2) INFORMATION FOR SEQ ID NO: 2291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2291 
ATTTTGGCAC CGAAGAAGA 
2) INFORMATION FOR SEQ ID NO: 2292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2292 
GAATCGGCAA GACAATATG 



2) INFORMATION FOR SEQ ID NO: 2293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2293 

ATGAATAGAA TAAAAGTTGC AATACTGTTT GGGGGTTGCT CAGAGGAGCA 5 0 

TGACGTATCG GTAAAATCTG CAATAGAGAT AGCCGCTAAC ATTAATAAAG 100 

AAAAATACGA GCCGTTATAC ATTGGAATTA CGAAATCTGG TGTATGGAAA 150 

ATGTGCGAAA AACCTTGCGC GGAATGGGAA AACGACAATT GCTATTCAGC 2 00 

TGTACTCTCG CCGGATAAAA AAATGCACGG ATTACTTGTT AAAAAGAACC 2 50 

ATGAATATGA AATCAACCAT GTTGATGTAG CATTTTCAGC TTTGCATGGC 3 00 

AAGTCAGGTG AAGATGGATC CATACAAGGT CTGTTTGAAT TGTCCGGTAT 3 50 

CCCTTTTGTA GGCTGCGATA TTCAAAGCTC AGCAATTTGT ATGGACAAAT 4 00 

CGTTGACATA CATCGTTGCG AAAAATGCTG GGATAGCTAC TCCCGCCTTT 450 

TGGGTTATTA ATAAAGATGA TAGGCCGGTG GCAGCTACGT TTACCTATCC 500 

TGTTTTTGTT AAGCCGGCGC GTTCAGGCTC ATCCTTCGGT GTGAAAAAAG 550 

TCAATAGCGC GGACGAATTG GACTACGCAA TTGAATCGGC AAGACAATAT 6 00 

GACAGCAAAA TCTTAATTGA GCAGGCTGTT TCGGGCTGTG AGGTCGGTTG 650 

TGCGGTATTG GGAAACAGTG CCGCGTTAGT TGTTGGCGAG GTGGACCAAA 700 

TCAGGCTGCA GTACGGAATC TTTCGTATTC ATCAGGAAGT CGAGCCGGAA 750 

AAAGGCTCTG AAAACGCAGT TATAACCGTT CCCGCAGACC TTTCAGCAGA 800 

GGAGCGAGGA CGGATACAGG AAACGGCAAA AAAAATATAT AAAGCGCTCG 850 

GCTGTAGAGG TCTAGCCCGT GTGGATATGT TTTTACAAGA TAACGGCCGC 900 

ATTGTACTGA ACGAAGTCAA TACTCTGCCC GGTTTCACGT CATACAGTCG 950 

TTATCCCCGT ATGATGGCCG CTGCAGGTAT TGCACTTCCC GAACTGATTG 100 0 

ACCGCTTGAT CGTATTAGCG TTAAAGGGGT GA 1032 



2) INFORMATION FOR SEQ ID NO: 2294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 94 
AAACGAGGAT GATTTGATTG 



2) INFORMATION FOR SEQ ID NO: 2295 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2295 
TTGAGCAAGC GATTTCGG 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 2296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2296 

ATGAATAAAA TAAAAGTCGC AATTATCTTC GGCGGTTGCT CGGAGGAACA 50 

TGATGTGTCG GTAAAATCCG CAATAGAAAT TGCTGCGAAC ATTAATACTG 100 

AAAAATTCGA TCCGCACTAC ATCGGAATTA CAAAAAACGG CGTATGGAAG 150 

CTATGCAAGA AGCCATGTAC GGAATGGGAA GCCGATAGTC TCCCCGCCAT 2 00 

ATTCTCCCCG GATAGGAAAA CGCATGGTCT GCTTGTCATG AAAGAAAGAG 2 50 

AATACGAAAC TCGGCGTATT GACGTGGCTT TCCCGGTTTT GCATGGCAAA 3 00 

TGCGGGGAGG ATGGTGCGAT ACAGGGTCTG TTTGAATTGT CTGGTATCCC 350 

CTATGTAGGC TGCGATATTC AAAGCTCCGC AGCTTGCATG GACAAATCAC 4 00 

TGGCCTACAT TCTTACAAAA AATGCGGGCA TCGCCGTCCC CGAATTTCAA 450 

ATGATTGAAA AAGGTGACAA ACCGGAGGCG AGGACGCTTA CCTACCCTGT 500 

CTTTGTGAAG CCGGCACGGT CAGGTTCGTC CTTTGGCGTA ACCAAAGTAA 550 

ACAGTACGGA AGAACTAAAC GCTGCGATAG AAGCAGCAGG ACAATATGAT 600 

GGAAAAATCT TAATTGAGCA AGCGATTTCG GGCTGTGAGG TCGGCTGCGC 650 

GGTCATGGGA AACGAGGATG ATTTGATTGT CGGCGAAGTG GATCAAATCC 700 

GGTTGAGCCA CGGTATCTTC CGCATCCATC AGGAAAACGA GCCGGAAAAA 750 

GGCTCAGAGA ATGCGATGAT TATCGTTCCA GCAGACATTC CGGTCGAGGA 800 

ACGAAATCGG GTGCAAGAAA CGGCAAAGAA AGTATATCGG GTGCTTGGAT 850 

GCAGAGGGCT TGCTCGTGTT GATCTTTTTT TGCAGGAGGA TGGCGGCATC 900 

GTTCTAAACG AGGTCAATAC CCTGCCCGGT TTTACATCGT ACAGCCGCTA 950 

TCCACGCATG GCGGCTGCCG CAGGAATCAC GCTTCCCGCA CTAATTGACA 1000 

GCCTGATTAC ATTGGCGATA GAGAGGTGA 102 9 



2) INFORMATION FOR SEQ ID NO: 2297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2297 

TTCAGGAGGG GGATCGC 17 



1176 

SUBSTITUTE SHEET (RULE 26) 



